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ABSTRACT 

Very-long-baseline  interferometry  (VLB1)  is  a  technique  which 
potentially  can  measure  intercontinental  distances  with  1  cm 
precision.  Already,  measurements  have  been  made  which  have 
nearly  reached  this  goal,  if  we  accept  the  standard  deviations 
calculated  from  statistical  arguments.  However,  the  sources 
of  errors  in  these  measurements  must  be  carefully  analyzed  and 
accounted  for  to  ensure  reliablity  in  the  interpretation  of 
results  obtained  from  VLBI  data  analysis. 


In  this  thesis,  we  concentrate  on  examining  both  the  theory 
and  the  quality  of  the  VLBI  delay  measurements.  Using  the 
redundant  nature  of  the  VLBI  observations,  we  develop  a  number 
of  quality  tests.  These  tests  are  applied  to  data  taken  in  VLBI 
experiments  between  July  1980  and  January  1982.  All  of  these 
tests  lead  to  similar  qualitative  conclusions  about  the  performance 
of  the  Mark  III  VLBI  system.  There  are,  however,  differences 
from  each  of  these  tests  in  the  quantitative  conclusions  about 
the  system  performance.  These  tests  indicate  that  the  actual 
statistics  of  the  VLBI  group  delay  observations  do  not  match  the 
theoretically  calculated  statistics.  \  The  differences  between 
the  actual  and  theoretical  statistics^ are  of  two  types.  Firstly, 
there  appears  to  be  a  proportional  error  in  the  calculated 
standard  deviations  of  between  1.1  and  1.2,  which  is  probably 
due  to  correlator  non-reproducibility  and  the  effects  of  instru¬ 
mental  dispersions.  Secondly,  there  appears  to  be  a  threshold 
performance  limit  which  can  not  be  penetrated.  This  threshold 
limits  the  accuracy  of  the  X-band  (a?8.34  GHz)  group  delay 
measurements  to  between  0.040  nsec  (» 1  .  2  cm)  and  0.015  nsec  (s0.5  cm), 
depending  on  the  experiment.  The  S-band  (*2.3  GHz)  system 
appears  to  have  a  performance  limit  of  0.15  nsec  (S?4.5\m)  for  all 
of  the  experiments  analyzed.  \ 

We  have  also  used  data  from  the  VLBI  experiments  conduced 
between  July  1980  and  June  1983,  to  estimate  the  distances 
between  radio  telescopes  in  North  America  and  Europe.  The  estimate 
of  the  Westford,  Massachusetts  to  Onsala,  Sweden  baseline  length, 
obtained  from  &0  experiments  using  these  sites,  has  a  weighted 
root-mean-square  repeatability  of  2.0  cm  and  a  statistical  standard 
deviation  of  0.6  cm.  However,  studies  of  these  solutions  indicate 
that  the  actual  standard  deviation  of  the  baseline  length  estimate 
is  probably  more  in  accord  with  repeatability  rather  than  with 
the  statistical  estimate  of  its  value  obtained  from  the  analysis 
of  the  ensemble  of  data.  The  estimate  of  the  rate  of  change  of 
the  length  of  this  baseline  is  1.6  +  0.5  cm/yr,  which  is  consistent 
with  the  inferred  average  rate  of  1.7  cm/yr. 


The  determination  of  the  tidal  parameters  of  the  earth  are 
also  investigated.  The  results  obtained  are  consistent  with  the 
currently  accepted  values  for  these  parameters. 
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I ntroduction 


In  April  1968  the  first  bandwidth-synthesis,  very-long- 
baseline-  inter  ferometry  (VLBI)  experiment  was  conducted  using 
the  NRAO  Mark  I  recording  system  (Whitney,  1974)  .  Two  experi¬ 
ments  later  in  January  1969  the  bandwidth  synthesis  technique 
was  successfully  used  to  measure  accurate  group  delays  (errors 
<10  nsecs  or  »3  m  effective  path  length) .  The  January  1969 
experiment  yielded  estimates  of  the  relative  station  locations 
of  two  radio  telescopes,  one  in  Westford,  Massachusetts,  the 
other  in  Green  Bank,  West  Virginia  which  were  separated  by 
approximately  eight  hundred  kilometers.  These  estimates 
agreed  with  conventional  survey  results  to  within  2  m  in  the 
distance  between  the  sites  (or  baseline  length)  and  within  5  m 
in  orientation.  Thirteen  years  after  the  first  successful 
experiment,  baseline  lengths  of  over  eight  thousand  kilometers 
are  now  routinely  being  measured  using  the  Mark  III  VLBI 
system  with  precisions  of  a  few  centimeters  —  unmatched  by 
any  conventional  surveying  techniques.  In  this  thesis  we  will 
investigate  the  accuracy  of  the  results  achievable  with  the 
currently  available  Mark  III  system  and  the  improvement  we  can 
expect  in  the  future. 

The  measurement  of  group  delays  gave  the  bandwidth 
synthesis  VLBI  system  a  major  advantage  over  other  VLBI 
systems  available  in  1969.  Prior  to  this  time  the  accuracy  of 
the  group  delay  measurements  had  been  limited  by  the  bandwidth 
of  the  recording  systems.  The  maximum  recorded  bandwidth 


available  was  4  MHz  for  the  Canadian  long  baseline  interferom¬ 
etry  (LBI)  system.  The  NRAO  Mark  I  recording  system  was 
capable  of  recording  bandwidth  of  only  360  kHz.  The  bandwidth 
synthesis  system  (in  1969)  measured  group  delays  by  physically 
changing  the  frequency  of  the  second  local  oscillator  in  the 
receiver  and  sequentially  sampling  the  output  of  the  receiver 
with  the  narrow  bandwidth  recorder.  This  method  allowed  a 
large  bandwidth  to  be  sampled  ( ="spanned")  while  recording 
with  a  narrow  bandwidth.  The  accuracy  of  the  group  delay 
determination  then  became  a  function  of  the  spanned  bandwidth 
rather  than  the  recorded  bandwidth.  With  the  Mark  III  VLBI 
system  it  is  no  longer  necessary  to  switch  the  frequency  of 
the  local  oscillators  because  signals  from  up  to  28  channels, 
each  of  2  Mhz  bandwidth  can  be  recorded  simultaneously, 
eliminating  the  need  to  switch  frequencies. 

Even  before  the  first  successful  bandwidth  synthesis  VLBI 
experiment,  the  applications  of  such  a  powerful  geodetic 
technique  were  being  envisaged  (Shapiro,  1968;  Shapiro  and 
Knight,  1970).  Potentially,  intercontinental  baseline  lengths 
could  be  measured  with  precisions  of  a  few  centimeters.  The 
applications  of  such  results  would  be  numerous.  For  the  first 
time,  it  would  be  possible  to  determine  the  current-day  plate 
velocities  with  a  time  resolution  of  only  a  few  years.  These 
rates  could  be  measured  between  points  thousands  of  kilometers 
away  from  active  spreading  centers  or  convergence  zones, 
thereby  being  immune  to  any  local  effects  at  these  centers  or 
zones.  Also,  the  technique  could  be  used  to  monitor  earth 
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rotation  and  polar  motion  with  unprecedented  precision  and 
time  resolution.  It  was  envisaged  that  the  position  of  the 
rotation  axis  with  respect  to  the  crust  of  the  earth  could  be 
determined  with  a  precision  of  better  than  10  cm  every  dav 
The  system  would  provide  an  almost  inertial  reference  frame 
because  of  the  great  distances  to  the  quasars  which  were  being 
observed.  After  several  decades  of  operations  the  precession 
constant  and  terms  in  the  nutation  series  could  be  accurately 
measured.  This  information  would  then  be  useful  in  constrain¬ 
ing  earth  rheology  and  structure  models. 

For  all  of  the  above  goals  to  be  realized  three  factors 
are  essential:  accurate  observations,  accurate  models  and 
appropriate  statistics  to  account  for  any  inadequacy  in  the 
models.  In  this  thesis  we  will  concentrate  on  the  first  of 
these  factors.  We  commence  in  Chapter  2  with  a  detailed 
review  of  the  VLBI  observables.  The  fundamental  observable  of 
VLB I  is  the  phase  of  the  cross  spectrum  of  the  data  recorded 
at  two  sites.  However,  this  observable  is  not  readily  used 
because  in  general  we  have  no  way  of  determining  the  integral 
number  of  cycles,  or  ambiguities,  associated  with  the  phase 
observation.  The  bandwidth-synthesis  technique  overcomes  this 
problem  by  using  the  group  delay,  i.e.  the  derivative  of  the 
phase  with  respect  to  (angular)  frequency.  We  will  investi¬ 
gate  the  nature  of  these  observables  and  in  particular 
consider  the  quantities  which  affect  the  phase  observable  and 
hence,  in  general,  the  group  delay.  Also  in  Chapter  2  we  will 
discuss  the  phase-calibration  system.  We  will  associate  this 


particular  calibration  with  the  observations  themselves 
because,  when  available,  it  is  automatically  applied  during 
the  processing  of  the  recorded  data.  The  term  observation  has 
now  implicitly  come  to  mean  "phase-calibrated  observation." 

In  Chapter  3  we  will  consider  methods  of  assessing  the 
accuracy  of  the  VLBI  observations.  Specifically,  we  will 
consider  closure  of  observations  around  triplets  of  baselines, 
analysis  of  the  behavior  of  the  residual  phase  in  each  fre¬ 
quency  channel  and  the  behavior  of  the  phase-calibration 
system. 

A  major  advance  in  eliminating  the  random  error  and 
minimizing  some  of  systematic  errors  in  VLBI  observations 
would  be  achieved  by  eliminating  the  phase-delay  ambiguities. 
With  current  Mark  III  observations  from  two  widely  separated 
frequencies,  the  group  delays  should  be  sufficiently  accurate 
to  allow  this  elimination  (observations  made  at  two  widely 
seperated  frequencies  allow  the  plasma  delays  to  be  estim¬ 
ated)  .  This  problem  is  dealt  with  in  Chapter  4.  In  many 
respects,  the  investigations  of  phase-delay  ambiguity  estima¬ 
tion  can  be  considered  as  accuracy  checking.  We  start  by 
evaluating  the  approximations  made  in  using  dual- frequency 
data  to  calibrate  for  the  delay  through  the  ionosphere.  We 
then  investigate  how  these  formulas  should  be  applied  to  VLBI 
data.  To  check  the  algorithms  used  to  remove  ambiguities,  we 
have  applied  them  to  data  taken  on  pairs  of  sources  which  were 
observed  repeatedly  over  many  hours.  With  such  closely  spaced 
observations  we  are  able  to  "phase  connect"  the  data  without 
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use  of  the  group  delay.  We  are  then  able  to  compare  these 
connected  phase  delays  with  the  predicted  ambiguities  based  on 
the  group  delays. 

The  studies  of  phase-delay  ambiguity  prediction  have 
produced  a  technique  which  can  be  used  to  check  the  accuracy 
of  the  VLB I  group  delay  measurements.  We  then  investigate  in 
Sections  4.3  and  4.4  the  accuracy  of  the  group  delay  measure¬ 
ments  from  both  the  phase  connected  experiments  and  a  geodetic 
schedule  (i.e.,  a  sequence  of  observations  which  are  used  to 
determine  estimates  of  geodetic  parameters) . 

In  Chapter  5,  we  will  summarize  the  estimates  of  baseline 
lengths  obtained  from  VLBI  experiments  spanning  the  interval 
from  May  1978  to  June  1983.  In  addition,  we  will  also  present 
estimates  of  the  Love  numbers  h  and  *  and  the  tidal  lag  angle. 

The  weight  of  this  thesis  may  imply  that  all  aspects  of 
radio  interferometry  are  addressed  in  this  volume.  This 
thesis,  however,  will  address  only  topics  which  are  applicable 
to  geodetic  uses  of  wideband,  very-long-baseline  interfero¬ 
metric  observations  of  extragalatic  radio  soucres. 

Many  of  the  questions  which  VLBI  measurements  will  be 
able  to  answer  are  related  to  geophysics.  Consequently,  this 
thesis  will  not  only  be  directed  towards  experts  in  VLBI,  but 
also  towards  the  potential  users  of  the  measurements  obtained 
with  VLBI  —  earth  scientists.  Users  of  the  system  must  be 
aware  not  only  of  the  system's  advantages  but  also  the 
system's  disadvantages  and  limitations.  This  need  is  most 
evident  in  the  interpretation  of  the  uncertainty  of,  and  the 
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correlation  between,  results.  It  is  critical  that  VLBI  users 
(actually  users  of  all  advanced  geodetic  techniques)  be  aware 
of  underlying  assumptions  used  in  data  acquisition  and  proces¬ 
sing,  especially  when  these  assumptions  may  introduce  correla¬ 
tions  between  results  or  unrealistically  small  uncertainties. 
Throughout  the  thesis  a  conscious  effort  will  be  made  to  make 
all  asumptions  and  their  consequences  explicit.  In  this  way 
it  is  hoped  that  the  full  capability  and  limitations  of  the 
Mark  III  VBLI  system  can  be  understood  by  all  readers. 


THE  VLB I  OBSERVABLES 


VLBI  measurements  can  be  used  to  obtain  accurate  esti¬ 


mates  of  geodetic  parameters,  e.g.  the  relative  positions  of 
radio  telescopes  and  radio  sources  and  the  location  of  the 
earth's  rotation  axis  with  respect  to  the  crust  of  the  earth 
and  the  source  coordinate  frame.  Analysis  of  changes  in  these 
geodetic  parameters  over  a  period  of  time  could  be  used  to 
answer  questions  about  the  dynamics  of  the  earth's  crust  and 
the  properties  of  the  earth's  interior.  However,  the  esti¬ 
mates  of  the  geodetic  parameters  are  themselves  the  end  result 
of  a  process  which  contains  many  steps.  In  this  chapter  we 
investigate  the  first  step  of  this  process  -  the  inference  of 
the  values  of  the  VLBI  observables.  For  the  geodetic  applic¬ 
ations  of  VLBI  these  observables  consist  of  group  delay,  phase 
delay  and  phase  delay  rate.  Although  these  quantities  will  be 
referred  to  as  the  VLBI  observables,  the  measurements  made 
during  an  experiment  consist  of  recordings  of  radio  signals 
from  extragalatic  radio  sources.  These  recordings,  which  not 
only  contain  signals  from  the  radio  source  but  also  noise  from 
the  radio  receiver  and  the  regions  surrounding  each  antenna, 
are  not  used  directly  to  estimate  geodetic  parameters,  but 
they  do  contain  information  which  is  used  for  this  purpose  in 
a  later  stage  of  the  analysis. 

The  information  we  can  use  is  the  difference  in  the 


arrival  times  of  the  signals  from  the  radio  source  at  two 
sites.  We  would  expect  that  the  time  of  arrival  of  the 
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signals  from  the  radio  source  would  be  a  function  of  the 
location  of  the  radio  telescopes  with  respect  to  the  radio 
source  (Figure  2.1).  When  two  radio  telescopes  observe  the 
same  source  we  should  be  able  to  compare  the  recordings  from 
the  two  sites  and  deduce  the  difference  in  arrival  times  of 
the  signals  at  the  two  telescopes.  In  addition,  since  the 
radio  telescopes  are  located  on  a  rotating  earth,  we  should  be 
able  to  measure  the  rate  of  change  of  the  arrival  times.  This 
latter  quantity  is  the  phase  delay  rate.  But  how  do  we  obtain 
optimum  estimates  of  these  quantities  from  the  data  recorded 
at  each  telecope?  To  answer  this  question  we  will  need  to 
know  the  nature  of  the  signals  being  received  from  the  radio 
source  and  the  nature  of  any  noise  sources.  We  will  study 
this  problem  in  Section  2.1.  In  Section  2.2  we  will  investi¬ 
gate  the  methods  used  to  infer  the  values  of  the  VLBI 
observables  from  the  recordings  at  each  radio  telescope,  and 
in  Section  2.3  we  will  investigate  the  phase  calibration 
system  which  is  used  to  calibrate  the  propagation  delay 
through  the  receiver  system  of  the  radio  telescope. 


2.1  The  properties  of  the  signals  recorded  during  a  VLBI 


observation 


In  this  section  we  will  investigate  two  types  of  radio 
emission  which  are  encountered  in  geodetic  VLBI  observations 
of  extragalatic  radio  sources.  These  two  types  are  thermal 
emission  and  electron  synchrotron  emission.  The  thermal 


gure  2.1  Generalized  and  simplified  geometries  of  a  VLB!  observation 
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emissions  are  generally  associated  with  noise  sources,  e.g. 
receiver  noise,  cable  losses  and  atmospheric  emission,  whereas 
the  synchrotron  emission  is  the  dominant  emission  process 
occurring  in  the  radio  sources  (generally  quasars)  which  are 
observed.  In  studying  these  mechanisms  we  will  mainly  be 
interested  in  two  of  their  properties  --  the  statistical  prop¬ 
erties  of  the  electromagnetic  field  components  of  the  emission 
and  the  power  spectral  density  of  the  emission.  The  power 
spectral  density  is  of  interest  to  us  because  its  Fourier 
transform,  when  normalized,  will  be  the  autocorrelation 
function  of  the  signals  in  the  time  domain. 

Thermal  emissions  in  all  bodies  are  due  to  changes  in  the 
quantum  states  at  random  times.  If  the  body  is  a  perfect 
absorber  (and  hence  a  perfect  radiator) ,  it  is  referred  to  as 
a  blackbody  and  the  brightness  of  the  radiation  from  the  body, 
Br,  at  a  given  frequency  is  a  function  only  of  the  temperature 
of  the  body  (See  Table  2.1.1  for  definitions  of  brightness 
and  other  quantities,  as  given  by  Krauss  (1966)).  This 
relationship  is  described  by  Planck's  radiation  law 
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where  h  is  Planck's  constant  (6.63x10  Joule  sec),  f^  is 

8  —  1 

frequency  (Hz),  c  is  the  velocity  of  light  {*  3x10  m  sec  ), 

-23  -1 

k  is  Boltzmann's  constant  (=1.38x10  Joule  K  ),  and  T  is 
the  temperature  of  the  body  (K) . 

This  type  of  radiation  characterizes  many  of  the  noise 
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Table  2.1.1  Definitions  of  Radio  Source  Properties 


2  2 

Brightness  Br  (w/m  -Hz-rad  ) 


Power  dW  =  Br  cos©  dQ  dA  dv  (w) 


Spectral  power  dw  =  Br  cos0  dQ  dA 
(w/Hz) 

Flux  density  F  =  / /  B  (0,$)  dQ 
sourcer 


Power  received  per  unit 
area  per  unit  solid  angle 
per  unit  bandwidth 
power  received  in  bandwidth 
dv  from  solid  angle  dQ 
incident  on  area  dA  with 
angle  0  between  normal  to 
surface  and  direction  to 
emission 

power  per  unit  bandwidth 
flux  density  of  source 


(w/m2 -Hz)  =  (1026  Janskys) 

Power  pattern  Pn(0,<{>)  response  of  an  antenna  to 

(dimensionless)  power  from  direction  e,<|> 

normalized  to  1  at  Pn(0,0) 

Observed  flux  density  FQ  =  ff  Br  ( e  r  4> )  Pn  ( ©  r  <J> )  dQ 

source 


(if  source  is  small  and  Pn(0,$)  *  1,  then  FQ  ®  Br  Qg  where 
Q_  is  the  angular  extent  of  the  source) 

S 


Poynting  vector  P  *  (ExH) 
2 


(ws/m  )  (MKSA  units) 

Electric  field  strength  E  v/m 
Magnetic  field  strength  H  a/m 
For  time  harmonic  fields  P(u)  =  E(u>)xH  (u>) 
P(u>)  *  2F  if  E  and  H  are  interpreted  as 
field  densities 

Rayleigh-Jean ' s  Law  (for  f<<kT/h  and 
black  body  radiation) 

Br  =  2kT/X 2 


power  flow  through  unit 
area 


-v..- 

^  v  . 


.y-.~v 
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sources  encountered  in  radio  astronomy.  Since  we  are  gener¬ 
ally  interested  in  determining  if  a  signal  is  greater  than  the 
noise  level,  it  has  become  common  practice  to  associate  emis¬ 
sions  of  all  types  with  an  equivalent  blackbody  temperature. 
However,  this  temperature  can  only  be  associated  accurately 
with  a  physical  temperature  for  blackbodies. 

We  can  deduce  some  statistical  properties  of  this  type  of 
radiation  based  only  on  the  nature  of  the  emission  process. 
Because  the  radiation  is  due  to  the  sum  of  the  emissions  from 
individual  electrons  which  are  changing  quantum  states  at 
random  times  we  would  expect  the  components  of  the  electric 
field  (E  field)  to  be  random.  In  addition,  since  the  compo¬ 
nent  of  the  E  field,  at  a  given  frequency,  will  be  the  sum  of 
the  emissions  from  many  individual  electrons  changing  state, 
we  can  use  the  central  limit  theorem  of  probability  to  deduce 
that  these  components  will  be  Gaussianly  distributed.  (Since 
the  E  field  probability  distribution  is  probably  different  for 
each  state  change  we  need  to  use  the  relaxed  version  of  the 
central  limit  theorem  which  does  not  require  identical  proba-. 
bility  distributions.  This  substitution  is  valid  since  no 
single  electron's  emission  contributes  nearly  all  of  the  total 
energy  radiated.)  Since  the  spectrum  of  the  emissions  has 
components  at  high  frequency  (see  Planck's  radiation  law),  we 
also  know  that  the  signals  will  have  a  very  short  correlation 
t  ime . 

Electron  synchrotron  radiation  is  the  other  major  emis¬ 
sion  process  we  will  consider.  Any  accelerating  free  electron 
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will  emit  electromagnetic  radiation.  Any  free  electron  moving 
in  a  uniform  magnetic  field  will  travel  in  a  helical  path 
(neglecting  collisions  between  the  electron  and  other  atomic 
particles).  If  the  electron's  velocity  is  much  less  than  the 
speed  of  light,  the  radiation  pattern  of  the  emission  will 
extend  over  a  large  angular  range  perpendicular  to  the 
acceleration  vector  of  the  electron  (Figure  2.1.1a).  When  the 
velocity  of  the  electron  is  near  that  of  the  speed  of  light, 
the  radiation  from  the  electron,  as  seen  from  the  frame  of  the 
observer,  will  appear  to  "beam",  i.e.  radiate  in  a  very  narrow 
cone  in  the  instantaneous  direction  of  motion  of  the  electron 
(Figure  2.1.1b).  We  now  investigate  the  nature  of  this  type 
of  emission. 

If  the  observer  is  in  the  plane  of  the  motion  of  the 
electron,  he  or  she  will  observe  a  short  pulse  of  radiation 
each  orbit  of  the  electron.  In  Table  2.1.2  some  of  the  major 
formulas  associated  with  synchrotron  radiation  are  given 
(compiled  from  Jackson  1975,  Chapter  14).  Since  the  emission 
from  the  electron  is  periodic  (assuming  negligible  energy  loss 
and  hence  ensuring  that  each  pulse  is  nearly  identical)  ,  the 
power  spectrum  of  the  emission  will  only  have  components  at 
discrete  frequencies,  i.e.  the  emission  will  not  have  a  con¬ 
tinuous  power  spectrum.  The  frequency  spacing  in  the  spectrum 
will  be  the  inverse  of  the  orbital  period  of  the  electron. 

For  the  case  given  in  Table  2.1.2,  the  frequency  spacing  will 
be  approximately  1.4xl0-^  Hz.  From  the  duration  of  the  Dulse 
we  can  also  calculate  approximately  the  frequency  component  in 
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Figure  2.1.1  Radiation  patterns  from  accelerating  electrons 
in  a  uniform  magnetic  field 


v 


a.  Radiation  pattern  when  v<<c 


Table  2.1.2  Synchrotron  radiation  equations  (MKS  units) 


For  an  electron  with  rest  mass  mQ  moving  at  velocity  v  in  the 

region  of  a  uniform  magnetic  field  §m  we  have  (see  Figure  2.1.1) 

Energy  of  electron  Z,  =  m^  c2y 
-2  2  2  ° 

where  y  =l-v/c  ,  c  =  velocity  of  light 

Radius  of  orbit  p  =  mocy/(eBm)  ,  e=  charge  of  electron. 

Beam  width  0  =  l/(2y) 

2 

Pulse  duration  t  =  m  /(2eB  v  ) 

-  p  o  m 

Energy  radiated/ (rad/sec) -unit  solid  angle 


I 


e2c 

3*2c 


(<i>)  + 


(l/y)2+  p2 


where  e  is  the  charge  of  an  electron,  u>  is  frequency  (rad/sec)  , 
p  is  the  angle  between  the  observer  and  the  direction  of  the 
instantaneous  velocity  of  the  source,  *2/3  and  Kl/3  are  A*ry 
functions  and 


+ 


3/2 


For  w  <<  1/t 


I|P=0  =  e 


2C  [ii£]2/3  [I'_(|/3)_]2  (3/4)l/3 


Example  For  £  =  1013eV  =  l.  6xlO_®J,  Bm  =  10-<*  Gauss=10  8Wb/m2, 

e  =  1.6x10  19  Coul.,  m  =  9.1xl0-31  kg,  c=  3xl08  m/sec, 

9  25  °  9 

d  =  10  ly  (10  m)  and  w  =  50x10  rad/sec,  we  have 

y  =  1. 95xl07 ,  p  =  3.33xl012  m,  0  =  2.56xl0-8  rad  (5.3  mas), 

tp  =  7.5xl0“18  sec,  orbit  period  =  7.0x10^  sec 

I,.  =  8.8xl0~2*  Joule/ (rad/sec) -rad2 

lp~°  -24  2 

Average  power  radiated  each  orbit  =  7.2x10  w/Hz-rad  . 

9 

10  light  years  from  the  source  the  flux  density  from  this 

-  7  5  2 

single  electron  would  be  dF  =  7.2x10  w/m  -Hz  and  hence  the 
number  of  electrons  necessary  to  produce  a  1  Jansky 
do'26  w/m2-Hz)  source  is  1.4x10^. 


the  spectrum  with  the  maximum  power.  This  frequency  for  an 

13  .  17 

electron  with  an  energy  of  10  ev  is  1.3x10  Hz;  see  Table 
2.1.2.  (We  have  taken  a  maximum  value  for  the  energy  of  an 
electron  in  a  quasar  to  ensure  that  the  calculated  number  of 
electrons  is  a  minimum  so  that  we  will  be  sure  that  the 
application  of  the  central  limit  therom  will  always  be  valid 
(see  discussion  later  in  this  section).) 

In  Table  2.1.2  we  calculate  the  number  of  electrons 
needed  to  produce  a  radio  source  with  a  flux  density  of  a  1 

9 

Jy,  10  light  years  from  the  observer  (a  typical  distance  for 

47 

quasars) .  The  estimated  1.4x10  electrons  is  certainly  an 
underestimate  because  we  assumed  all  electrons  were  beaming 
directly  at  the  observer  and  no  account  was  made  for  absorp¬ 
tion  or  scattering  of  the  radiation  as  it  propagated  from  the 

source  to  the  receiver.  Since  the  emission  from  at  least 
47 

1.4x10  electrons  sum  together  to  yield  the  total  radiated 
emission,  we  may  use  the  central  limit  theorem  to  deduce  that 
the  spectral  components  of  the  radiation  will  be  Gaussianly 
distributed  provided  the  emissions  from  the  electrons  (or  at  . 
least  groups  of  electrons)  are  independent.  Since  a  uniform 
magnetic  field  is  necessary  to  produce  the  helical  orbits  of 
the  electrons  we  may  expect  that  this  field  could  cause  some 
dependence  of  the  emissions  from  different  electrons. 

However,  two  properties  of  the  emission  from  quasars  would 
suggest  that  the  emissions  from  the  electrons  are  not  strongly 
correlated.  Firstly,  the  emission  from  a  single  electron  is 
linearly  polarized  if  the  electron  is  moving  in  a  plane 


circular  orbit,  but  emission  from  most  quasars  shows  only  a 
small  amount  of  linear  polarization  (see,  e.g..  Haves,  1975; 
Altschuler  and  Wardle,  1977;  Conway  et  al . ,  1977; 
Simard-Normandin  et  al . ,  1981;  Simard-Normandin  et  al . ,1982) 
indicating  that  the  magnetic  fields  in  the  emission  region  are 
not  uniform  or  that  the  radiation  has  been  depolarized  as  it 
propagated  from  the  source  to  the  earth,  e.g.  by  scattering. 
Secondly,  if  all  the  emissions  were  strongly  correlated  then 
it  would  be  highly  unlikely  that  the  earth  would  fall  within 
the  beam  of  the  emission  and  we  would  probably  not  be  able  to 
see  any  quasars  unless  the  actual  number  of  quasars  were  very 
large . 

From  the  simplified  consideration  of  synchrotron  radi¬ 
ation  we  have  learned  two  important  properties  of  synchrotron 
radiation:  1)  The  components  of  the  the  Fourier  transform  of 

the  signals  from  the  quasar  (i.e.,  components  of  the  spectrum) 
are  Gaussianly  distributed  and  2)  the  power  spectrum  of  the 
emission  from  quasars  is  effectively  continuous  and  extends  to 
high  frequency.  (While  the  radiated  power  from  individual 
electrons  occurs  at  discrete  frequencies,  the  power  spectrum 
of  the  sum  of  the  emissions  from  many  electrons  with  different 
energies  will  fill  nearly  all  of  the  frequencies  in  the  power 
spectrum . ) 

The  signals  emitted  from  the  radio  sources  are  not 
directly  used  to  infer  the  values  of  the  VLBI  observables. 

The  recordings  of  these  signals  are  used.  The  signals  from  a 
quasar  will  propagate  along  separate  paths  to  each  radio 
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telescope  observing  the  source.  The  propagation  medium  along 
each  of  these  paths  will,  in  general,  have  different  proper¬ 
ties.  For  each  path  there  will  be  possibly  different  propaga¬ 
tion  velocities  and  different  absorption  characteristics,  i.e. 
there  will  be  losses  from  the  high  and  low  frequency  compon¬ 
ents  of  the  spectra  of  the  signals  by  the  time  the  signals 
reach  the  radio  telescopes. 

The  signals  will,  however,  undergo  most  changes  as  they 
propagate  through  the  receiving  electronics  at  the  radio 
telescopes.  In  Figure  2.1.2  we  show  a  schematic  diagram  of  a 
VLB  I  radio  receiver  and  recorder  equipment.  Along  with  the 
diagram  of  the  receiver  we  also  show  schematically  the  power 
spectrum  of  the  signal  at  a  number  of  locations  in  the 
receiver. 

The  first  major  change  to  the  signal's  spectrum  occurs  as 
the  signal  propagates  through  the  feed  horns  which  are  design¬ 
ed  to  receive  a  relatively  narrow  range  of  frequencies  (usual¬ 
ly  several  hundred  MHz).  For  Mark  III  geodetic  VLBI  experi¬ 
ments  there  are  generally  two  feed  horns,  one  for  X-band  (“8  . 
GHz)  and  another  for  S-band  (  =  2  GHz) .  There  will  be  losses  in 
the  power  of  the  spectral  components  with  frequencies  falling 
outside  the  operating  range  of  the  feed  horns.  Also  there 
will  be  dispersive  propagation  delays  introduced  to  the  signal 
(dispersive  is  used  here  to  indicate  that  the  derivative  of 
the  phase  with  respect  to  frequency  is  not  independent  of 
frequency).  We  will  investigate  the  dispersive  delays  through 
the  feed  horns  in  Section  3.1. 


RF  Amplifier  IF  Amplified 

(8180-8600  Mhz)  (100-550  Mhk) 


Figure  2.1.2  Schematic  diagram  of  a  radio  telescope  receiver  for  VLBI 
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After  passing  through  the  feed  horn  the  signals  propagate 
along  a  waveguide  towards  the  first  amplifier.  Shortly  after 
entering  the  waveguide  a  companion  signal  is  added  to  the 
signal  received  by  the  antenna.  This  companion  signal  is  a 
series  of  low  power  pulses,  each  of  less  than  50  psec  duration 
and  spaced  at  1  Msec  intervals.  These  pulses  allow  the  delays 
through  the  receiver  system  to  be  calibrated.  Their  operation 
will  be  discussed  in  detail  in  Section  2.3. 

The  signal  and  the  phase  calibration  pulses  undergo 
extensive  modification  as  they  propagate  through  the  receiver. 
Shortly  after  the  injection  of  the  phase  calibration  pulses 
the  signals  are  amplified.  The  amplified  signals  are  high 
pass  filtered  and  then  multiplied  by  a  high  frequency  signal 
(8.08  GHz  for  X-band  and  2.02  GHz  for  S-band) .  These 
frequencies  are  referred  to  as  the  first  local  oscillator  (LO) 
frequencies.  The  high  pass  filter  is  such  that  all  signals 
with  frequencies  less  than  that  of  the  first  LO  are  removed. 
This  multiplication  or  heterodyning  of  the  signal  will  shift 
its  spectrum.  (Heterodyning  is  also  often  referred  to  as 
mixing.)  If  we  denote  the  original  spectrum  by  3(w)  then  the 
heterodyned  signal  will  have  a  spectrum  given  by  (Bracewell, 
1978,  p.108  -  modulation  theorem) 

S  (u>)  =  S(u>+w  )  e+^r  +  §(u>-u>  )  e~^r 

m  r  r 

where  Wr  and  4^  are  the  frequency  and  phase  of  the  local- 
oscillator  signal,  and  §m(w)  is  the  spectrum  of  the  signal 
after  mixing.  We  have  dropped  the  factors  of  one  half  which 


would  normally  appear  in  this  equation  because  of  the  ampli¬ 
fication  of  the  signal  which  accompanies  the  mixing  operation. 

This  signal  is  band  pass  filtered  to  remove  the  frequency 
components  near  16  GHz  for  X-band  and  near  4  GHz  for  S-band. 
This  intermediate  frequency  (IF)  signal  is  now  amplified  and 
sent  via  coaxial  cable  to  the  IF  distributor.  The  signals  we 
wish  to  observe  are  now  in  the  frequency  ranges  of  100-550  MHz 
for  X-band  and  200-550  MHz  for  S-band  (see  Figure  2.1.2). 

The  IF  distributor,  as  its  name  implies,  separates  the  IF 
signals  into  28  separate  channels  which  are  input  to  28  video 
converters.  These  video  converters  will  mix  a  frequency 
(different  for  each  video  converter)  between  100-550  MHz  with 
the  IF  signal.  This  operation  will  shift  the  spectrum  of  the 
signal  into  the  range  0-2  MHz.  Each  video  converter  will 
shift  a  different  part  of  the  spectrum  into  this  range.  The 
mixers  in  the  video  converters  are  single  sideband  mixers  and 
will  output  two  channels,  one  for  the  upper  sideband  and  the 
other  for  the  lower  sideband.  The  output  from  either  or  both 
channels  may  be  recorded  on  magnetic  tape.  (If  both  upper  and 
lower  sideband  signals  are  recorded,  they  are  recorded  on 
separate  tracks  on  the  tape.  When  signals  from  both  sidebands 
are  recorded  the  total  number  of  widely  spaced  frequency 
channels  which  can  be  sampled  is  reduced  by  one  half  from  the 
number  which  could  be  sampled  if  only  one  (single)  sideband 
were  recorded.  The  advantages  of  recording  both  sidebands 
will  be  discussed  in  Appendix  B.) 

In  Figure  2.1.3  the  effects  of  the  mixing  of  the  signal 
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are  shown  graphically.  (While  examining  this  figure  it  is 
important  to  remember  that,  in  general,  spectra  contain  both 
positive  and  negative  frequencies.  For  real  signals  (complex 
sense) ,  the  negative  frequencies  must  be  present  to  ensure 
that  the  imaginary  parts  of  the  Fourier  integral  will  be 
zero.)  From  the  figure  it  can  be  seen  that  components  of  the 
spectrum  from  above  and  below  the  LO  frequency  will  add 
together  when  the  signal  is  mixed  with  only  a  cosine  signal. 
(This  mode  of  mixing  is  referred  to  as  double  sideband 
mixing.)  To  avoid  this  overlap  of  upper  and  lower  sidebands  a 
single  sideband  mixer  is  used.  This  overlap  of  sidebands  does 
not  occur  in  the  first  mixing  operation  (which  only  mixed  a 
cosine  signal)  because  the  signals  with  frequencies  less  than 
that  of  the  first  LO  are  removed  (by  filtering)  before  mixing. 
With  a  single  sideband  mixer,  the  signal  is  split  into  two 
channels,  and  a  sine  is  mixed  with  one  and  a  cosine  wave  with 
the  other.  The  sine  channel  (i.e.,  the  channel  mixed  with  the 
sine  wave) ,  after  mixing,  is  rotated  by  -90°  of  phase  and 
added  to  the  cosine  channel.  Their  sum  will  contain  only  the. 
frequency  components  from  above  the  LO  frequency  (Rogers, 

1971) .  This  sum  is  referred  to  as  upper  sideband  (USB) .  If 
the  90°  rotation  is  applied  to  the  cosine  channel,  the  sum 
will  contain  only  frequency  components  from  below  the  LO 
frequency.  This  sum  is  the  lower  sideband  (LSB) . 

When  single  sideband  mixers  are  used  in  the  video  con¬ 
verters,  the  output  signal  of  the  mixers  for  upper  sideband 
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will  be  related  to  the  input  IF  signal  by 

n  i  Au 

Su(wy)  =  Sm(wv+  u>x)  e  1VI  ;  if  uv>  0 

and 

-u  -  -i*u 

Su(u>v)  =  Sm(wv-  Wj)  e  l*I  ;  if  u>v<  0, 

where  Su  ( cov )  is  the  spectrum  at  video  frequencies  (i.e,  the 
video  spectrum)  ,  and  coj  and  are  the  frequency  and  phase  of 
the  local  oscillator  in  the  video  converter.  We  may  combine 
the  effects  of  the  two  local  oscillators  into  a  single  expres¬ 
sion.  For  upper  sideband  the  relationship  between  the  RF 
spectrum  and  the  video  spectrum  will  be 


Su  (w  ) 

V 

=  s(«v+ 

V 

i 

if  o>v>  0 

(2. 1.1. a) 

and 

suuv) 

-  S(»v- 

) 

e-^l 

• 
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if  uv<  0, 

(2.1. l.b) 

where  =  o> 

r+wI  and 

♦; 

For 

the  lower  sideband,  the 
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RF  spectrum 

and  the  video 

spectrum 
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• 

9 

if  ajy<  0, 

(2 . 1. 2.b) 

0  0 

where  ^  =  $r+4>j.  is  the  lower  sideband  phase  shift.  Ideally 
f  u 

4>^  will  equal  4>^  but  this  equality  may  not  always  hold  (see 
Appendix  B) . 

Before  the  video  signals  are  recorded,  they  are  hard- 
limited  or  clipped.  The  clipping  is  achieved  by  amplifying 
the  signal  to  the  saturation  point  of  the  amplifier,  i.e.  the 
output  of  the  amplifier  is  plus  or  minus  its  saturation 


’  *  «  *  *  * 


voltage.  This  amplified  signal  is  then  uniformly  sampled  at 
the  Nyquist  rate  for  a  2  MHz  bandwidth  signal.  The  output  of 
the  clipper  is  either  plus  or  minus  one  depending  on  whether 
or  not  the  input  voltage  was  positive  or  negative.  In  this 
mode  a  single  bit  on  the  tape  can  be  used  to  represent  the 
data  sample. 

Up  until  the  clipping  of  the  signal  the  operation  of  the 
receiver  has  only  rotated,  shifted  in  frequency,  and  added 
noise  spectra  to  the  spectrum  of  the  signal  from  the  radio 
source.  The  clipping  operation  will  however  change  the  shape 
of  the  spectrum  of  the  signal  and  noise.  These  changes  in  the 
spectra  will  change  the  correlation  function  of  the  signals. 
The  changes  in  the  correlation  function  of  a  signal  which  has 
been  clipped  have  been  previously  studied  (Van  Vleck  and 
Middleton,  1966;  Whitney,  1974).  For  a  stationary  process, 
the  autocorrelation  function,  Ac(x),  of  a  clipped  signal  will 
be  given  by 

Ac(t)  =  (2/n)  sin-1  A  ( x )  , 

where  A(x)  is  the  autocorrelation  function  of  the  original 
signal,  and  i  is  the  lag  introduced  for  correlation. 

The  signal  plus  noise  which  is  recorded  on  magnetic  tape 
is  bandlimited  to  the  frequency  range  of  0-2  MHz  (the  actual 
recorded  signals  do  not  cut  off  instantaneously  at  2  MHz  nor 
do  they  have  a  zero  frequency  component;  this  DC  component  is 
filtered  out  before  recording).  The  spectral  components  of 
the  signals  in  this  0-2  MHz  range  are  the  same  components  that 
occurred  in  the  spectrum  of  the  signals  from  the  radio  source 
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in  the  range  0-2  MHz  around  their  radio  frequency  (RF) ,  (they 
have  however  been  amplified)  with  added  noise  which  also  came 
predominantly  came  from  0-2  MHz  band  around  the  RF.  The  major 
noise  source  is  the  first  amplifier  in  the  receiver  chain. 

Over  the  2  MHz  range  at  RF  the  signal  and  noise  power  have 
constant  amplitude  and  hence  the  video  signals  will  also  have 
a  constant  power  spectrum.  A  signal  whose  power  spectrum  is 
constant  in  the  range  0-2  MHz  will  have  an  autocorrelation 
which  is  given  by  s  i  n  { 2itBx)  /  (2,hBt)  ,  where  B  is  the  bandwidth 
(2  MHz  in  our  case) .  The  autocorrelation  function  of  the 
clipped  signal  will  therefore  be 

Ac(t)  =  (2/tt)  sin-1{sin(2nBx) /(2tiBt)  } 

If  the  clipped  signal  is  sampled  at  the  Nyquist  rate  (1/2B 
time  interval  between  samples),  we  see  that  Ac(n/2B)  is  zero 
for  all  integral  £  and  hence  that  the  samples  of  the  clipped 
signal  will  be  uncorrelated. 

2.2  Cross  spectrum  phase  and  its  relationship  to  group  delay, 
phase  delay  and  phase  delay  rate. 

In  Section  2.1  we  examined  the  signals  which  are  recorded 
on  magnetic  tape  at  each  radio  telescope  during  a  VLBI  obser¬ 
vation.  In  this  section  we  will  investigate  the  method  used 
to  extract  the  values  of  the  VLBI  observables  from  these 
recordings.  The  recorded  data  consist  of  three  components: 
the  signals  from  the  radio  source  (which  will  be  present  on 
the  recordings  at  each  site);  the  noise,  mainly  from  the 


receivers  at  each  site;  and  the  pulses  generated  by  the  phase 
calibration  system.  The  statistical  properties  of  the  record¬ 
ed  data  were  discussed  in  Section  2.1.  The  spectral  compo¬ 
nents  of  both  the  radio  source  signal  and  the  receiver  noise 
should  be  Gaussianly  distributed.  For  the  radio  sources 
observed  in  geodetic  VLBI  experiments,  the  power  spectrum  of 
the  signals  over  the  2  MHz  recorded  bandwidth  will  be  constant 
(except  at  the  edges  of  the  band  where  it  rapidly,  but  not 
instantaneously,  decreases) .  The  power  spectrum  of  the  noise 
should  also  be  constant  over  the  2  MHz  recorded  bandwidth. 

If  we  were  to  cross  correlate  the  tape  recordings  from 
each  site,  we  would  expect  that  the  signals  from  the  radio 
source  would  correlate  and  cause  a  peak  in  the  correlation 
function  at  a  lag  between  the  tapes  corresponding  to  the 
difference  in  the  arrival  times  of  the  signals  (provided  the 
integrated  signal  strength  is  sufficently  greater  than  the 
noise  that  this  peak  can  be  distinguished  from  the  noise) . 

(For  the  moment  we  will  neglect  the  additional  complication 
that  the  Earth  is  rotating  and  hence  that  the  delay  is  not 
constant.)  This  attribute  could  be  used  to  estimate  the 
difference  in  arrival  times  of  the  signals  from  the  radio 
source.  An  alternative  technique  for  estimating  the  delay 
would  be  to  find  the  cross  spectrum  of  the  data  recorded  at 
each  site  and  use  the  phase  of  the  cross  spectrum  to  estimate 
the  delay:  the  phase  of  the  cross  spectrum  divided  by  the 
angular  frequency  will  be  the  delay. 

There  would  appear  to  be  a  difference  in  information 
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content  between  these  two  estimates.  If  all  the  data  are 
cross  correlated,  then  only  a  single  estimate  of  the  delay  is 
obtained.  On  the  other  hand,  if  the  cross  spectrum  approach 
is  used,  then  an  estimate  of  the  delay  is  obtained  for  each 
frequency  at  which  the  cross  spectrum  is  computed.  The  reason 
for  this  apparent  difference  in  information  content  is  that  we 
have  been  discussing  two  different  types  of  delay.  When  the 
data  are  cross  correlated  and  the  delay  determined  by  maxi¬ 
mizing  the  cross  correlation  function  we  are  measuring  the 
difference  in  arrival  time  of  packages  of  energy  from  the 
source  (see,  e.g.,  Jackson,  1975,  pp  209-303).  These  packages 
of  energy  will  propagate  at  a  group  velocity  and  the  measured 
difference  in  arrival  times  will  be  a  difference  in  group 
delays.  When  the  cross  spectrum  approach  is  used,  we  are 
decomposing  the  energy  packets  into  their  individual 
"monochromatic"  waves.  These  waves  will  propagate  at  their 
respective  phase  velocities.  At  each  frequency  the  phase  of 
the  cross  spectrum  is  a  measure  of  the  phase  delay  at  that 
frequency.  However,  the  phase  can  only  be  measured  modulo  2n. 
radians  and  hence  from  the  cross  spectrum  we  do  not  obtain 
unique  estimates  of  the  phase  delays. 

The  apparent  difference  in  information  content  of  the  two 
techniques  can  be  resolved  by  noting  that  the  group  delay  is 
the  derivative  of  the  phase  with  respect  to  angular  frequency. 
Hence,  the  group  delay  which  is  obtained  from  cross  correl¬ 
ating  data,  can  be  obtained  from  the  cross  spectrum  results  by 
determining  the  derivative  of  the  phase  of  the  cross  spectrum 


with  respect  to  angular  frequency.  This  derivative  will  be 
the  group  delay. 

In  practice  the  use  of  either  of  the  techniques  discussed 
above  is  complicated  by  the  rotation  of  the  earth  and  hence  by 
the  delay  changing  with  time.  To  overcome  this  problem  we 
could  cross  correlate  the  data  from  two  tapes  while  running 
the  tapes  at  different  speeds  however  with  digital  recordings 
this  procedure  would  be  very  difficult  to  implement.  With  the 
cross  spectrum  technique,  the  spectra  of  the  data  recorded  at 
each  site  will  be  Doppler  shifted  if  the  stations  are  moving 
with  respect  to  the  source  and  hence  the  phase  of  the  cross 
spectrum  computed  by  Fourier  transforming  the  data  changes  its 
meaning.  If  the  Doppler  shifts  of  the  signals  are  known  then 
the  spectra  of  the  data  from  each  site  can  be  corrected  and  a 
more  useful  cross  spectrum  constructed. 

A  method  for  overcoming  the  problems  associated  with  the 
rotation  of  the  earth  and  for  determining  the  maximum  likeli¬ 
hood  estimates  of  the  group  delay,  phase  delay  and  phase  delay 
rate  when  the  delay  is  a  function  of  time  was  developed  by 
Rogers  (1970) .  Rogers  constructed  a  function,  called  the 
delay  resolution  function,  which  depended  upon  the  cross 
spectral  components  of  the  recorded  data  determined  from  short 
segments  of  data  and  trial  values  of  the  phase  delay,  group 
delay,  and  the  phase  delay  rate.  The  values  of  these  latter 
two  quanities  which  maximize  the  value  of  this  function  can  be 
shown  to  be  the  maximum  likelihood  estimates  of  the  VLBI 
observables  (see  Appendix  A). 
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To  overcome  the  problem  associated  with  the  rotation  of 
the  earth,  we  can  segment  the  recorded  data  from  each  channel 
into  short  intervals  (e.g.,  intervals  of  4  ^sec  duration)  and 
compute  the  cross  spectra  of  these  4  nsec  segments  of  data 
(sixteen  data  samples  from  each  channel  are  used  to  determine 
the  cross  spectrum  for  that  channel).  Only  8  complex  compo¬ 
nents  of  the  cross  spectrum  are  determined  from  the  4  nsec 
span  of  data  (i.e.,  the  cross  spectrum  is  computed  at  eight 
equally  spaced  frequencies  spanning  the  2  MHz  recorded  band¬ 
width)  .  Over  such  a  4  nsec  interval,  the  change  in  phase  of 
the  cross  spectrum  due  to  the  rotation  of  the  earth  will  be 
small,  and  the  computed  cross- spectral  components  will  approx¬ 
imate  the  cross  spectrum  which  would  have  been  obtained  had 
the  two  sites  been  moving  with  equal  velocity  (see  Appendix  A 
for  details) .  The  cross-spectral  components  from  all  of  the  4 
nsec  data  segments,  from  all  of  the  channels,  could  then  be 
used  in  the  delay  resolution  function  to  estimate  the  values 
of  the  VLB  I  observables.  (This  technique  of  using  4  jisec 
segments  of  data  is  not  the  actual  technique  used  by  the  Mark. 
Ill  correlator,  but  it  will  provide  us  with  a  method  for  study 
of  the  actual  techniques  used;  see  Appendix  B  for  more 
details.) 

In  Appendix  A  we  review  the  development  of  the  delay 
resolution  function  and  show  that  when  the  statistical  proper¬ 
ties  of  the  signal  and  noise  are  independent  of  time  and 
frequency,  the  maximization  of  the  delay  resolution  function 
is  the  maximum  likelihood  estimator  of  the  VLBI  observables 
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and  is  equivalent  to  a  least  squares  estimator.  The  data 
which  would  be  used  in  the  least  squares  estimator  are  the 
phases  of  the  cross-spectral  components  computed  at  7  frequen¬ 
cies  from  each  4  nsec  data  segment  or  from  coherent  averages 
of  these  cross-spectral  phases  from  many  4  nsec  data  segments 
(see  Equation  (A. 16)). 

The  delay  resolution  function  D(Xg»Xp)  *s  given  by 
Equation  (A. 10)  (see  Appendix  A  for  details): 


BT 
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where  are  the  components  of  the  cross  spectrum,  at 

frequency  w ^ ,  of  the  data  recorded  at  each  site  at  time  At^ 
relative  to  the  center  of  the  observation  (At^  refers  to  the 
epoch  of  the  center  of  the  short  segment  of  data  used  to 
determine  the  cross  spectrum) ,  xg  and  Xp  are  trial  values  of 
the  group  delay  and  phase  delay  rate  which  are  varied  until 
the  magnitude  of  D(Tg,tp)  is  maximized.  The  phase  of 

A  A  w 

D(?  ,t  ),  where  $  and  *  are  the  values  of  *  and  t  which 
y  f  y  r  y  p 

maximize  the  magnitude  of  the  delay  resolution  function,  is 
the  estimate  of  the  visibility  phase  referred  to  the  frequency 
uQ.  (The  visiblity  phase  is  an  estimate  of  the  cross-spectral 
phase  at  ;  see  Appendix  A  for  more  detailed  discussion.) 

The  input  for  the  delay  resolution  is  not  the  data 
recorded  during  the  observation,  but  rather  the  components  of 
the  cross  spectrum  of  the  recorded  data.  We  have  discussed 
one  method  for  obtaining  these  components:  by  cross  correlat¬ 
ing  short  segments  of  data  and  finding  the  Fourier  transform 


of  the  cross  correlation  functions  from  each  data  segment.  In 
Appendix  B  we  investigate  the  methods  used  by  the  Mark  III 
correlator  to  generate  the  cross-spectral  components  from  the 
recorded  heterodyned  signals. 

The  actual  implementation  of  maximizing  the  delay  resolu¬ 
tion  function  is  not  as  straightforward  as  the  above  equation 
(Equation  A. 10)  implies.  (The  material  discussed  here  is 
treated  extensively  in  Appendix  B;  in  the  following  discussion 
we  will  try  to  highlight  the  major  points  from  this  appendix.) 
The  summation  given  in  Equation  (A. 10)  is  two  summations,  one 
over  time  and  the  other  over  frequency.  In  the  implementation 
of  Equation  (A. 10)  these  summations  are  treated  separately. 

The  cross-spectral  components  within  each  frequency  channel 
are  coherently  averaged  over  a  period  of  time  referred  to  as 
an  accumulation  period  (usually  -2  sec) .  The  coherent  average 
corresponds  to  initially  carrying  out  the  time  summation  in 
sections,  each  of  a  sufficiently  short  duration  that  the  a 
priori  phase-delay  rate  can  be  used  in  the  summation  rather 
than  the  trial  values  of  the  phase-delay  rate.  The  trial 
values  of  the  phase-delay  rate  are  used  to  sum  the  coherent 
averages.  The  frequency  summation  is  also  separated  into  two 
parts  (but  for  reasons  different  from  those  for  separating  the 
time  summation) .  The  frequencies  of  the  cross-spectral  compo¬ 
nents  are  clustered  in  groups  in  each  2  MHz  bandwidth  channel, 
with  the  groups  separated  from  one  another  by  several  tens  to 
several  hundreds  of  megaHertz.  The  group  delay  will  be  more 
precisely  estimated  the  using  the  cross-spectral  phases  from 


widely  spaced  frequencies.  Hence,  two  group  delays  are 
estimated  --  a  "multiband"  group  delay,  which  is  estimated 
from  the  slope  of  the  "video  DC"  phases  from  the  widely  spaced 
2  MHz  channels  and  a  "single  band"  group  delay  which  is  esti¬ 
mated  from  the  average  of  the  slopes  of  the  cross-spectral 
phases  for  the  individual  2  MHz  channels.  (The  "video  DC" 
phases  are  the  estimates  of  the  cross  spectral  phase  at  u>v=0 
(video  DC) ,  which  would  be  obtained  from  least-squares 
estimates  of  a  slope  and  an  intercept  (at  1^=0)  ,  from  the 
cross-spectral  phases  at  14  or  7  discrete  video  frequencies 
(depending  upon  whether  dual  or  single-sideband  data  were 
recorded,  respectively.)  The  slope  is  the  singleband  delay, 
and  the  intercept  is  the  video  DC  phase;  see  Section  B.2,  for 
more  details.)  The  singleband  group  delay  estimate  is  much 
less  precise  then  the  multiband  group  delay  estimate. 

Thus,  there  are  three  quanities  for  which  the  delay 
resolution  function  is  maximized:  the  multiband  group,  the 
singleband  group  delay  and  the  phase  delay  rate.  The  estimate 
of  the  visibility  phase  is  the  phase  of  the  delay  resolution  . 
function  after  the  estimates  of  the  above  three  quantities  are 
substiuted . 

The  estimate  of  the  multiband  group  delay  is,  in  general, 
not  unique.  The  frequency  spacings  of  the  widely  separated 
channels  are  chosen  to  be  multiples  of  the  minimum  frequency 
spacing  (this  type  of  frequency  spacing  ensures  low  "sidelobe" 
levels  in  the  delay  resolution  function)  and  hence  it  is 
possible  to  find  equally  good  representations  of  the  observed 
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(video  DC)  phases  by  shifting  the  group  delay  by  sec, 

where  Atomjn  is  the  minimum  (angular)  frequency  spacing.  This 
change  in  the  group  delay,  called  the  group-delay  ambiguity, 
will  change  the  estimates  of  the  video  DC  phases  of  all  of  the 
(widely  spaced)  frequency  channels,  by  exact  multiples  of  2* 
radians.  Hence,  the  difference  between  the  observed  video  DC 
phases  and  the  estimates  of  their  values  calculated  from  the 
group  delay,  visibility  phase  and  phase-delay  rate  will  be 
unaffected.  (Remember  that  we  have  no  way,  other  than  from  a 
priori  information,  of  knowing  which  multiple  of  2n  radians 
the  phases  in  each  channel  should  be  located  on;  see  Appendix 
A  for  more  details.) 

The  signals  received  at  each  site  may  be  recorded  in  one 
of  two  ways.  In  geodetic  Mark  III  experiments,  before  June  24, 
1981,  the  signals  were  recorded  with  dual  sidebands,  i.e., 
both  the  upper  and  the  lower  sidebands  were  recorded.  This 
recording  technique  offers  the  advantage  that  some  of  the 
systematic  errors  which  are  introduced  by  the  approximations 
made  in  the  correlator  algorithms,  cancel  with  this  mode  of 
recording  (see  Appendix  B  for  details).  The  disadvantage  of 
this  recording  mode  is  that  each  channel  requires  a  pair  of 
tape  tracks,  limiting  the  maximum  number  of  frequency  channels 
which  can  be  recorded  in  Mode  B  to  seven.  (Mode  B  is  a 
recording  mode  for  which  14  tracks  on  the  tape  are  recorded 
during  the  forward  motion  of  the  tape  and  the  other  fourteen 
are  recorded  in  the  reverse  motion  of  the  tape  (Rogers  et  al . , 
1983)).  When  only  single  sideband  data  are  recorded  (i.e.. 


when  only  one  output  from  each  video  converter  is  sent  to  the 
tape  recorder) ,  14  frequency  channels  can  be  recorded  (in  Mode 
B) .  These  additional  frequency  channels  can  be  used  to  reduce 
the  sidelobe  levels  in  the  delay  resolution  function  (see 
Whitney,  1974,  and  Robertson  and  Cater,  1983,  for  diagrams  of 
the  delay  resolution  function) . 

When  the  algorithms  which  are  used  to  generate  the  cross- 
spectral  components  were  derived  in  Appendix  B,  several 
approximations  were  made.  The  effects  of  these  approximations 
are  also  studied  in  this  appendix.  We  found  that  when  the 
phase  delay  rate  is  greater  than  10  sec/sec,  the  biases 
introduced  into  the  visibility  phase  should  be  less  than  10°, 
when  only  single  sideband  data  are  used.  (The  phase-delay 
rate  will  exceed  10-9  sec/sec  most  of  the  time  on  baselines 
greater  than  10  km  long  if  uQ  >2.3  GHz.)  When  upper  and  lower 
sideband  data  are  used,  in  equal  proportions,  the  visibility 
phase  errors  should  be  zero.  Only  one  approximation  (accumu¬ 
lating  the  cross  correlation  function  with  an  erroneous 
phase-delay  rate)  was  found  which  could  affect  the  multiband 
group  delay.  Even  in  the  extreme  situation  studied  the  error 
in  the  multiband  delay  was  only  20  psec  (a0.7  cm)  at  X-band. 

In  general,  we  would  expect  this  approximation  to  introduce 
errors  in  the  multiband  group  delay  of  less  than  one  pico¬ 
second  (*0.3  mm)  . 

There  are  of  course  many  more  components  in  the  Mark  III 
system  which  could  affect  the  VLBI  measurements.  It  is  beyond 
the  scope  of  this  thesis  to  investigate  all  of  the  possible 
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interactions  between  these  components  and  the  VLBI  measure¬ 
ments.  In  Chapter  3  where  we  investigate  the  quality  of  the 
VLBI  measurements,  we  will  discuss  some  of  those  errors  due  to 
equipment  limitations  rather  than  to  the  approximations  made 
in  applying  the  estimation  procedures. 

Before  finishing  our  discussion  of  the  VLBI  observables 
we  need  to  discuss  one  final  aspect  of  the  measurement  tech¬ 
nique  --  the  phase-calibration  system. 


2.3  The  phase-cal ibration  system 


In  Section  2.1  we  discussed  the  system  components  which 
operate  on  the  signal  from  the  radio  source  as  the  signal 
propagates  through  the  receiver  to  the  tape  recording.  These 
components  (waveguides,  mixers,  amplifiers,  cables,  filters) 
will  greatly  affect  the  phase  of  the  signal,  e.g.,  1.5  m  of 
waveguide,  with  a  cutoff  wavelength  of  5.7  cm,  will  have  a 
phase  delay  of  3.9  nsec  at  8.3  GHz  which  corresponds  to  32.4 
cycles  (see  Section  3.1).  If  the  propagation  properties  of 
the  receiver  were  time  independent,  then  these  delays  through 
the  receiver  would  pose  no  serious  problem.  (The  propagation 
delays  would  be  constant  and  could  not  be  differentiated  from 
the  epoch  offset  of  the  hydrogen  maser.)  Unfortunately,  the 
properties  of  the  components  in  the  receiver  are  not  time 
independent.  There  are  two  different  types  of  phase  vari¬ 
ation  which  can  occur  in  the  receiver.  These  variations  are 
changes  in  the  local  oscillator  phases  which  will  add  direct- 


ly  to  the  signal  phases  (see  Equations  (2.1.1)  and  (2.1.2)) , 
and  variations  in  the  propagation  delays.  Both  of  these 
types  of  change  can  be  monitored  by  the  phase-calibration 
system. 

The  phase-calibration  system  consists  of  pulses  which 
are  injected  into  the  receiver  every  microsecond  (see  Figure 
2.1.2).  In  the  frequency  domain  the  pulses  will  form  a 
"comb"  of  Dirac  delta  function  "rails"  with  1  MHz  spacings 
between  the  rails.  The  pulses  are  of  finite  duration  (-50 
psec)  and  hence  the  comb  will  extend  to  approximately  the 
inverse  of  the  pulse  duration  («20  GHz)  ,  (Rogers,  1980). 

In  the  frequency  domain,  there  will  be  two  phase-cali¬ 
bration  rails  appearing  in  each  2  MHz  channel.  The  frequency 
boundaries  of  the  channels  are  set  such  that  these  rails  will 
appear  at  10  kHz  and  at  1010  kHz  for  upper  sideband  (USB) 
channels.  The  rails  will  appear  at  990  kHz  and  1990  kHz  in 
the  lower  sideband  (LSB)  channel  when  both  sidebands  are 
being  recored.  If  only  LSB  is  being  recorded,  the 
calibration  rails  would  be  located  at  10  kHz  and  1010  kHz. 

The  frequency  boundaries  of  each  channel  are  determined  by 
the  sum  of  the  local-oscillator  frequencies  which  are  mixed 
with  signals  from  the  radio  source,  and  the  recorded 
bandwidth . 

The  phase-calibration  pulses  will  propagate  through  the 
receiver  and  their  spectral  components  will  undergo  the  same 
phase  shifts  as  the  spectral  components  of  the  signals  from 
the  radio  source  at  the  corresponding  frequencies.  When  the 


phase-calibration  phases  are  subracted  from  the  video  DC 
phases  (see  Section  2.2),  the  calibrated  phases  become  refer¬ 
red  to  the  phase  of  the  pulses  at  injection  into  the  pre¬ 
amplifier  of  the  front  end. 

The  phase  of  only  one  of  the  calibration  rails  is 
extracted  during  correlation  (the  rail  occurring  near  zero 
frequency  (DC)  in  the  video  spectrum,  i.e.  usually  10  kHz). 
The  major  reason  for  extracting  a  single  calibration  phase 
concerns  the  use  of  dual  sideband  recordings.  We  discussed 
in  Section  2.2,  the  two  types  of  group  delay  which  are  esti¬ 
mated  during  processing  of  VLBI  data,  the  multiband  and 
single  band  group  delays.  The  extraction  of  a  second  cali¬ 
bration  phase  would  predominantly  affect  the  estimation  of 
the  singleband  group  delay  since  the  second  calibration  phase 
would  calibrate  phase  changes  over  the  2  MHz  recorded  band¬ 
width.  When  the  Mark  III  system  was  being  designed  the 
recording  made  was  expected  to  be  both  upper  and  lower  side¬ 
band  so  that  each  channel  in  the  synthesized  band  would  have 
effectively  a  4  MHz  bandwidth  with  the  reference  frequency 
(i.e.,  the  radio  frequency  corresponding  to  “v=0) ,  for  the 
channel  in  the  center  of  the  4  MHz  bandwidth.  Because  the 
multiband  group  delay  (this  is  the  delay  used  in  geodetic 
data  processing),  is  estimated  from  the  phases  at  the  refer¬ 
ence  frequency,  any  error  in  the  derivative  of  the  phase  with 
respect  to  frequency  across  the  4  MHz  (USB  +  LSB)  channel 
will  not  affect  these  phases  (because  of  symmetry)  and  hence 
will  not  affect  the  estimates  of  the  multiband  group  delay 


(see  Appendix  B  for  more  details) .  Consequently  there  was  no 

need  to  determine  the  phase  of  the  second  calibration  rail. 

However,  in  recent  Mark  III  VLBI  experiments  (since  June  24, 

1981)  only  single  sideband  recordings  have  been  used.  There 

were  two  reasons  for  the  change  in  recording  mode.  Firstly, 

the  assumption  of  the  insensitivity  of  the  phase  at  the 

reference  frequency  to  the  derivative  of  the  phase  with 

respect  to  frequency  across  the  4  MHz  bandwidth  is  only  valid 

if  the  same  weight  is  given  to  the  data  from  each  sideband. 

This  (weight)  equality  was  often  not  satisfied  because  of 

tape  reading  errors;  hence  many  of  the  advantages  of  using 

both  sidebands  were  compromised.  Secondly,  the  tape  density 

upgrade  (Hinteregger ,  1980;  Hinteregger,  1982)  was  taking 

longer  than  expected  and  there  was  a  desire  for  more 

frequency  channels  in  the  synthesized  bandwidth.  (The 

additional  frequency  channels  were  needed  to  allow  a  wider 

bandwidth  to  be  spanned  while  having  a  smaller  minimum 

frequency  spacing  (this  smaller  spacing  increases  the  group- 

delay  ambiguity;  see  Appendix  A) .  The  additional  frequencies 

also  made  the  group  delays  much  less  sensitive  to  phase 

errors  in  individual  channels  (see  Section  3.1).  With  single 

sideband  recordings,  the  extraction  of  a  second  calibration 

phase  would  reduce  errors  due  to  changes  in  the  system  delay 

(over  the  2  MHz  recorded  bandwidth) .  A  1  nsec  change  in  the 

delay  through  the  system  will  introduce  an  error  in  the  phase 

o  .  ,  , 

of  the  reference  frequency  of  0.36.  (This  number  is  easily 
calculated  by  noting  that  the  reference  frequency  is  1  MHz 


away  from  the  center  of  the  recorded  channel  and  hence  a 
change  in  the  delay  of  Atc  will  lead  to  a  phase  error  of 
Atcx1  MHz;  see,  also,  discussion  associated  with  Figure 
B.3.1).  In  Section  3.1  (Figure  3.1.5),  we  show  evidence 
which  indicates  that  such  changes  may  be  possible  at  some 
antennas  (although  the  phenomenon  shown  in  Figure  3.1.5  could 
be  explained  by  other  mechanisms  as  well).  At  this  time,  the 
correlator  modules  would  need  to  be  redesigned  to  allow  both 
calibration  phases  to  be  extracted.  When  the  tape  density 
upgrade  is  completed,  dual  sideband  observations  should  be 
adopted  again,  in  order  to  minimize  errors  due  to  system 
delay  changes. 

The  extraction  of  the  calibration  phases  and  amplitudes 

is  carried  out  in  the  correlator  module  before  the  data 

streams  are  cross  correlated.  Ideally,  to  extract  the  phase- 

calibration  phase  we  would  multiply  each  data  stream  by 

cosUct)  and  sin(wct),  where  uc  is  the  frequency  of  the 

calibration  rail,  and  then  low  pass  filter  the  output.  The 

calibration  phase  would  then  be  tan  1 (C  /C  ),  where  C  and  C 

sc  sc 

are  the  correlation  coefficents  from  the  sine  and  cosine 

multiplications.  The  phase  calibration  amplitude  is 

2  2 

calculated  from  /C  +C  .  This  amplitude  is  usually  expressed 

S  C 

_  4 

in  units  of  10  of  the  (on-source)  system  temperature,  and 
hence  is  a  function  of  the  power  in  the  calibration  pulses 
and  the  system  power.  The  calibration  amplitudes  are  not 
normally  used,  but  we  will  see  in  Section  3.2  that  these 
amplitudes  can  provide  a  useful  diagnostic  tool. 
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The  multiplication  by  sine  and  cosine  functions  is 
carried  out  in  the  correlator,  but  the  cosine  and  sine 
functions  actually  used  are  two-level  approximations,  i.e. 
they  are  either  plus  or  minus  one  depending  upon  the  argument 
of  the  function.  During  a  two  second  accumulation  period  the 
cosine  and  sine  will  rotate  through  20,000  cycles  and  the 
errors  introduced  by  using  the  two-level  cosine  and  sine 
function  will  cancel  to  a  very  high  degree.  (The  error  in 
the  calibration  phase  introduced  by  the  use  of  the  two-level 
sine/cosine  approximation  will  be  “5xl0~5  rad;  see  discus¬ 
sions  on  the  three-level  sine/cosine  function  in  Appendix  B.) 

The  use  of  a  two-level  cosine/sine  function  is  not 
without  its  problems  however.  The  signal- to-noise  ratio 
(SNR)  of  the  phase  determination  is  lowered  by  4/(n/2)=0.90 
(this  factor  is  the  ratio  of  the  Fourier  series  coefficient 
of  the  two-level  cosine  function  to  the  Fourier  series  coef¬ 
ficient  for  a  pure  cosine  function  divided  by  the  expected 
increase  in  noise  level,  /2) .  This  reduction  in  SNR  is  not  a 

serious  problem  because  the  effective  temperature  of  the 

4 

phase-calibration  rails  at  10  GHz  is  *  5x10  K  (Mark  III 
Documentation,  1980,  p.  DC-8)  and  after  a  1  second  integra¬ 
tion  time  the  SNR  will  be  300  if  the  system  temperature  is 

4 

167  K.  (The  5x10  K  temperature  of  the  calibration  rail 
corresponds  to  the  phase  calibration  pulse  power  being  =  1% 
of  the  total  system  power  in  a  2  MHz  bandwidth.)  With  such 
large  SNRs  a  reduction  of  10%  in  the  SNR  is  not  a  serious 
problem. 


The  second,  and  more  serious,  disadvantage  of  using  the 
two-level  sine/cosine  function  is  the  harmonics  which  are 
generated  by  this  type  of  function.  Since  the  two  level 
sine/cosine  function  is  periodic,  it  can  be  expanded  in 
Fourier  series  with  the  fundamental  frequency  being  the 
calibration  frequency  at  video,  i.e.  10  kHz.  The  Fourier 
coefficients  decay  as  1/n,  where  n  is  the  harmonic  number. 
These  harmonic  terms  should  cause  no  serious  problems  unless 
there  are  non-random  signals  at  these  frequencies.  There 
should  not  be  any  non-random  signals  at  these  frequencies, 
except  for  the  101st  harmonic  which  corresponds  to  the 
frequency  of  the  second  phase  calibration  rail  in  the  2  MHz 
bandwidth.  Approximately  1%  of  the  signal  in  this  second 
rail  will  be  included  in  the  computation  of  the  calibration 
phase.  This  leakage  of  the  second  calibration  rail  into  the 
first  calibration  rail  could  introduce  errors  of  order  0?6 
(0.01  rad)  in  the  estimated  calibration  phases.  These  errors 
should  be  kept  in  mind  as  the  precision  of  VLBI  observations 
improves.  At  this  time  there  seems  to  be  more  serious 
problems  which  need  solving  (see  Chapter  3). 

The  phase-calibration  pulses  provide  a  series  of  time 
"ticks"  at  the  injection  point,  to  which  the  signal  arrival 
times  can  be  referred,  i.e.  the  calibrated  cross-spectral 
phases  are  the  phase  differences  between  the  spectra  of  the 
signals  at  the  injection  points  of  the  pulses,  at  the  two 
sites  in  the  interferometer,  referred  to  the  phases  of  the 
spectral  components  of  the  calibration  pulses,  again  at  the 


injection  points  at  the  two  sites.  (See  discussion  in 
Appendix  B  concerning  the  singleband  delay.  Equation 
(B.3.3).)  Ideally,  we  would  like  the  signal  arrival  tines 
referred  to  the  tine  "ticks"  generated  by  the  hydrogen  naser 
frequency  standard  which  serves  as  the  naster  clock  at  each 
site.  If  the  propagation  delay  through  the  cable  which 
carries  the  5  MHz  signal  used  to  generate  the  phase-calibra¬ 
tion  pulses  renains  constant,  then  there  would  be  a  constant 
offset  between  the  hydrogen  naser  tine  ticks  and  the  tine 
ticks  provided  by  the  phase-calibration  pulses.  The  propaga¬ 
tion  delay  through  this  cable  is,  however,  a  function  of 
tenperature  and  tension  (and  possibly  other  tine-dependent 
quantities) .  To  ensure  that  this  offset  is  always  known,  the 
delay  through  the  cable  carrying  the  5  MHz  signal  is  noni- 
tored  using  a  cable-calibration  systen.  The  nethod  used  to 
neasure  the  cable  delay  relies  on  reflecting  a  nodulated 
version  of  the  5  MHz  signal  fron  the  cable/pulse  generator 
connector  back  down  the  cable.  (The  nodulation  allows  this 
reflection  to  be  separated  fron  other  possible  reflections  of 
the  5  MHz  signal.)  The  phase  difference  between  the  outgoing 
and  inconing  signals  neasures  the  changes  in  the  propagation 
delay  through  the  cable.  Except  for  clock  synchronization 
experinents  (Clark  et  al . ,  1979),  there  is  no  need  to  neasure 
the  absolute  length  of  the  cable. 

Before  conpleting  this  discussion  of  the  phase  calibra¬ 
tion  systen  we  should  discuss  one  possible  problen  with  this 
systen.  The  phase-calibration  systen  injects  pulses  into  the 
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receiver  which  will  be  almost  identical  in  shape  and  spacing 
at  each  site.  There  should  be  a  very  high  correlation 
between  the  phase-calibration  signals  when  the  recordings 
from  each  site  are  cross  correlated.  (The  phase-calibration 
signals  are  not  pulses  on  the  recordings  because  the  signals 
have  been  heterodyned  to  video  frequencies  (see  Section 
2.1).)  The  VLBI  estimation  algorithm  relies  on  the  correla¬ 
tion  between  the  signals  from  the  radio  source.  Will  this 
additional  correlated  signal  affect  the  measurements?  In 
general,  there  should  not  be  any  major  effect  because  of  the 
difference  in  fringe  rate  (i.e.,  the  rate  of  change  of  phase; 
see  also  discussions  on  maximizing  the  delay  resolution 
function,  Appendix  B.3) ,  between  the  phase  calibration  pulses 
and  the  signals  from  the  radio  source.  The  correlation 
between  the  phase-calibration  pulses  will  quickly  dissipate 
as  the  delay  between  the  tapes  is  changed  and  the  rotation 
corrections  are  applied  during  correlation  of  the  tapes.  We 
should  notice  that  observations  with  small  fringe  rates  *?ill 
be  most  affected  by  the  correlation  between  the  phase-cali¬ 
bration  pulses. 

We  may  calculate  the  expected  magnitude  of  the  contribu¬ 
tion  of  the  calibration  pulse's  cross  correlation  by  summing 
the  cross-spectral  components  of  the  calibration  pulses,  i.e. 

we  may  decompose  the  cross  spectra  of  the  recorded  signals 

* 

into  contributions  from  the  signal  (from  the  radio  source) , 
the  noise,  and  the  phase-calibration  signals.  The  cross 
correlation  between  the  calibration  pulses  will  introduce  an 


additional  phasor  to  the  signal  cross-spectral  phasor.  As 
the  cross-spectral  components  are  counter-rotated,  to  account 
for  the  changing  phase  of  the  signal  part  of  the  cross 
spectrum,  this  additional  phasor  (due  to  calibration  pulses) 
will  also  be  rotated.  However,  the  phase  of  the  cross- 
spectral  components  due  to  the  calibration  signals  will  be 
constant  (almost)  before  the  rotation  and,  hence,  after  the 
rotation,  they  will  no  longer  align.  We  may  calculate  the 
expected  amplitude  of  the  summed,  calibration  pulse  cross 
spectral  components,  by  replacing  the  summation  of  these 
contributions  to  the  cross  spectrum,  with  an  integration  (see 
Equation  (A. 15)  for  a  discussion  of  the  coherent  sum  of  the 
cross-spectral  components) .  This  amplitude,  after  T  seconds 
of  integration,  will  be 


Acal  3  I  /  A0expl-ia,rtaptl  dt  1 


(2.3.1) 


where  u>r  is  the  (angular)  RF  of  the  video  channel  before 
heterodyning,  Aq  is  the  amplitude  of  the  calibration  phasor 
(=1%  of  the  total  system  power,  i.e.  AQ  *0.01),  and  rgp  is 
the  a  priori  phase-delay  rate.  The  integration  of  Equation 
(2.3.1)  yields 


,1  -  2Vin  • 


(2.3.2) 


The  integration  of  the  signal  term  over  the  same  interval  of 


time,  assuming  that  is  close  to  the  actual  phase  delay 

rate  (see  Appendix  B  for  details) ,  will  be  (relative  to  the 
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system  power) 

^signal  =  (Ts^s^  T  (2.3.3) 

We  may  now  calculate  the  magnitude  of  the  error  in  the  cross- 
spectral  phases  due  to  cross  correlation  of  the  phase  cali¬ 
bration  pulses.  For  an  observing  frequency  of  2.3  GHz , 

9 

(w  =14.5x10  rad/sec),  x  =100  psec/sec  (small  value  even  for 

L  d  p 

baselines  less  than  1  km),  T=100  sec,  AQ=0.01,  and 
/Ta1Ta2/  (Ts^Ts^  =10” 4 ,  the  ratio  of  the  amplitudes  of  the 
integrated  calibration  phasor  to  the  signal  phasor  would  be 

Acal/Asignal  =0.014.  The  phase  calibration  cross  correlation 

.  .  o 

in  this  case  could  lead  to  phase  errors  of  up  to  0.8  (=0.014 

rad) .  This  corruption  of  the  signal  by  the  phase-calibration 
pulses  is  a  problem  for  observations  on  short  baselines  (<1 
km) .  The  problem  can  be  solved  by  offsetting  the  phase- 
calibration  rails  at  one  site.  However  the  current  Mark  III 
correlator  cannot  process  data  recorded  in  this  manner. 

We  have  now  completed  a  basic  discussion  of  the  VLBI 
observables  and  the  techniques  used  to  estimate  their  values.. 
The  discussion  in  Appendix  A  has  presented  a  variety  of 
methods  of  visualizing  and  studying  the  estimation  of  the 
values  of  the  VLBI  observables.  We  have  applied  some  of 
these  methods  in  Appendix  B  to  study  some  of  the  limitations 
of  the  methods  used  to  estimate  the  values  of  the  observ¬ 
ables.  We  will  now  investigate  other  procedures  which  can  be 
used  to  assess  the  quality  of  the  VLBI  measurements. 


-54- 


3.  The  Quality  of  the  Observations 

In  Chapter  2  we  studied  the  methods  used  to  estimate  the 
group  delay,  phase  delay  and  phase-delay  rate  from  signals 
recorded  at  two  radio  telescopes.  In  this  chapter  we  will 
investigate  the  quality  of  these  measurements.  We  have 
already  seen  that  the  observed  values  of  the  group  delay, 
phase  delay  and  phase  delay  rate  are  related  to  the  radio 
telescope's  position  and  the  radio  source's  position.  In 
Chapter  5  we  will  study  in  more  detail  the  relationship 
between  these  quantities  and  the  observations.  For  the  moment 
the  exact  relationships  are  not  important.  Given  such  a 
relationship,  one  method  of  checking  the  quality  of  the 
observations  would  be  to  compare  them  with  estimates  of  their 
values  calculated  from  such  a  model. 

There  are,  however,  some  major  problems  with  this 
technique  of  studying  observation  quality.  Firstly,  this 
technique  does  not  allow  the  errors  in  the  observations  to  be 
separated  from  the  inadequacies  of  the  models.  Secondly, 
because  the  models  depend  on  parameters  such  as  station  and 
source  positions,  any  observational  errors  will  likely  be  at 
least  partially  absorbed  into  the  estimates  of  these 
parameters.  (Ideally,  we  would  like  to  check  the  observation 
quality  at  the  subcentimeter  level.  In  order  to  achieve  this 
level  of  uncertainty  in  the  model  values  we  would  need  to 
estimate  corrections  to  the  model  parameters,  based  on  the 
observations  themselves.)  If  the  observational  errors  were 
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independent  and  Gaussianly  distributed  then  we  could  easily 
compute  the  expected  reduction  in  the  magnitude  of  the  postfit 
residuals  (i.e.,  the  differences  between  the  observations  and 
the  estimates  of  their  values  from  the  model,  after  parameters 
have  been  estimated  from  the  observations),  due  to  some  of  the 
errors  in  the  observations  being  partially  absorbed  into  the 
estimated  parameters  (see  Appendix  E) .  Unfortunately,  we  are 
interested  in  possible  systematic  errors  in  the  observations 
and  the  statistical  arguments  which  apply  to  "random"  errors 
are  likely  to  be  erroneous  when  applied  to  systematic  errors. 
In  particular,  one  question  we  would  like  to  answer  is:  what 
are  the  fractions  of  the  systematic  errors  which  are  absorbed 
into  the  estimates  of  the  station  and  source  positions?  The 
analysis  of  postfit  residuals  will  never  fully  answer  this 
question . 

How  then  are  we  going  to  study  the  quality  of  the  obser¬ 
vations?  A  VLB  I  observation  is  very  redundant  and  we  will  use 
this  redundancy  to  study  quality.  The  number  of  bits  recorded 

at  each  radio  telescope  during  a  100  second  observation  is 

9 

typically  3.2x10  .  These  bits  are  reduced  to  three  numbers 
which  are  used  for  geodetic  data  analysis  -  the  multiband 
group  delay,  phase-delay  rate  and  phase  delay  (the  visibility 
phase  divided  by  ojq  ;  see  Appendix  A). 

A  review  of  the  methods  used  to  achieve  this  reduction  in 
information  volume  will  reveal  several  methods  which  may  be 
used  to  study  the  quality  of  the  observations.  We  commence  by 
studying  the  behavior  of  the  "residual  phases"  from  each 
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frequency  channel.  The  residual  phases  are  the  differences 
between  the  coherently  averaged  cross-spectral  phases  in  a 
channel  and  the  estimate  of  the  cross-spectral  phase  for  that 
channel,  calculated  from  the  group  delay  and  the  visibility 
phase  (see  discussion  associated  with  Figure  3.1.1).  If  more 
than  two  channels  are  used  in  the  synthesized  band  then  we  may 
use  the  residual  phases  to  study  how  well  the  phases  in  each 
channel  are  modelled  by  the  group  delay  and  the  visibility 
phase.  We  will  study  the  residual  phases  in  Section  3.1. 

After  examining  the  residual  phases,  we  will  examine,  in 
Section  3.2,  the  consistency  of  the  phase  and  cable  calibra¬ 
tions.  These  studies  will  aid  in  interpreting  the  behavior  of 
the  residual  phases.  Finally,  in  Section  3.3,  we  will 
investigate  the  sum  of  delays  around  a  triplet  of  baselines  as 
a  measure  of  quality. 

Many  of  the  studies  of  the  quality  of  the  observations 
were  motivated  by  early  attempts  to  predict  phase-delay  ambi¬ 
guities  from  the  group  delays.  These  prediction  techniques, 
which  probably  provide  one  of  the  most  stringent  quality 
checks  of  the  VLBI  observations,  will  be  discussed  in  detail 
in  Chapter  4.  In  the  following  sections  we  will  attempt  to 
outline  the  properties  of  each  of  the  quality  checks.  Specif¬ 
ically,  we  will  be  most  interested  in  the  limitations  of  each 
of  the  checks,  i.e.  the  nature  of  errors  which  a  specific 
check  can  or  cannot  measure.  For  each  type  of  quality  check 
we  will  give  specific  examples.  We  will  not,  however,  be  able 
to  present  all  of  the  results  thus  far  obtained.  Instead,  we 


will  compile  the  results  of  the  quality  checks  and  make  some 
conclusions  about  the  performance  and  the  limitations  of 
currently  available  Mark  III  systems  (Section  4.4). 


Some  checking  of  the  quality  of  the  observations  is 
carried  out  before  the  data  analysis  stage.  The  program  FRNGE 
checks  a  number  of  error  conditions  during  correlation,  e.g.f 
non-detection  of  the  phase  calibration  signal,  loss  of  time 
synchroni zation  while  reading  the  data  tapes,  and  low  correla¬ 
tion  amplitudes,  are  some  of  the  checks  which  are  made. 
Observations  which  fail  these  checks  are  flagged.  These 
observations  will  be  shown  with  lower  case  letters  when  data 
are  plotted. 

3.1  Residual  phases 

In  Chapter  2  we  investigated  the  methods  used  to  estimate 
the  values  of  the  VLBI  observables  from  the  clipped  signals 
recorded  at  each  site  during  an  observation.  The  data 
recorded  in  each  frequency  channel,  at  each  site,  are  cross 
correlated  in  segments.  These  segments,  referred  to  as 
accumulation  periods,  are  typically  of  2  seconds  duration. 

From  these  complex  cross  correlation  functions,  cross  spectral 
components  are  computed  at  7  discrete  frequencies  in  each  2 
MHz  bandwidth  channel.  In  Appendix  A,  we  show  that  the  esti¬ 
mates  of  group  delay,  phase-delay  rate  and  visibilty  phase  are 
obtained  by  estimating  the  parameters  of  surface,  which  is 
linear  in  both  frequency  and  time,  from  the  cross-spectral 


phases.  (This  estimation  process  is  performed  by  maximizing 
the  delay  resolution  function.  Also,  separate  group  delays 
are  used  to  model  the  phase  variations  across  the  2  MHz 
channels  and  across  the  synthesized  band  (see  Appendix  B  for 
details) ,  the  first  of  these  delays  being  referred  to  as  the 
singleband  group  delay  and  the  second  as  the  multiband  group 
delay . ) 

The  observed  cross  spectral  phase  for  each  accumulation 
period  and  frequency  component  will  not  exactly  match  the 
phase  calculated  from  the  group  delay,  phase  delay  rate  and 
visibility  phase.  We  would  expect  these  differences  to  be  in 
accord  with  the  statistics  of  the  signals  and  the  noise.  It 
is  these  differences  we  wish  to  study. 

In  studying  these  differences  we  are,  however,  faced  with 
a  dilemma.  For  a  four-channel,  dual  sideband  observation  of 
100  seconds  duration,  there  will  be  2800  phase  differences  (50 
accumulation  periods  x  4  channels  x  14  cross  spectral  frequen¬ 
cies)  .  In  a  typical  multi-antenna  intercontinental  VLBI 
experiment  there  will  usually  be  over  1000  such  observations  , 
per  day.  Even  if  the  phase  differences  are  stored  as  single 
precision  numbers,  one  day  of  data  would  take  11.2  Mbytes  of 
storage.  Detailed  analysis  of  this  volume  of  data  is  imprac¬ 
tical  (with  currently  available  computers). 

In  this  section  we  will  study  averages  of  these  phase 
differences.  In  this  way  we  can  reduce  the  volume  of  data  to 
be  analyzed  to  tractable  size.  These  averages  will  be  taken 
over  time  and  frequency  within  each  channel  (both  upper  and 


lower  sidebands) .  For  a  four-channel  dual-sideband  observa¬ 
tion,  the  2800  phase  differences  are  reduced  to  four  phase 
differences,  one  phase  difference  for  each  frequency  channel 
in  the  synthesized  band.  We  will  refer  to  these  averaged 
phase  differences  as  "residual  phases".  These  phases  are 
shown  schematically  in  Figure  3.1.1.  The  averaging  is  carried 
out  in  the  program  "FRNGE"  which  estimates  the  group  delay, 
phase  delay  rate  and  visibility  phase  for  each  observation 
(see  Appendix  B) .  The  residual  phases  are  stored  in  the 
geodetic  data  base  (Ma,  1979)  and  hence  can  be  easily  accessed 
and  analyzed. 

For  the  statistics  of  the  signals  and  the  noise  sources 
given  in  Chapter  2,  we  would  expect  the  residual  phases  to  be 
zero  mean,  Gaussianly  distributed,  random  quantities. 

To  introduce  the  discussion  on  residual  phases  we  show  in 
Figures  3.1.2  and  3.1.3  plots  of  residual  phases  as  a  function 
of  time  from  observations  of  the  radio  source  3C  345  during 
the  June  1981  experiment.  These  plots  show  the  residual 
phases  in  each  of  the  four  frequency  channels  used  in  these 
observations.  The  residual  phase  in  Channel  1  (frequency 
8.34099  GHz)  is  shown  with  a  letter  which  represents  the 
source  being  observed  (these  plots  show  the  residual  phases 
for  a  single  source  3C345,  denoted  by  the  letter  E) .  The 
residual  phases  in  the  remaining  channels  are  denoted  by 
numbers  (i.e.,  the  numerals  2,  3,  and  4  denote  the  8.24099 
GHz,  8.49099  GHz,  8.54099  GHz  channels,  respectively). 

At  approximately  7:00  hr  UT,  typical  error  bars,  based  on 


Figure  3.1.1  Graphical  representation  of  the  residual  phases 
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the  signal-to-noi se  ratio  (SNR)  of  these  observations  are 
shown.  The  error  bars  take  into  account  that  residual  phases 
are  the  differences  between  the  observed  phases  in  each  chan¬ 
nel  and  the  calculated  values  for  these  phases,  after  esti¬ 
mating  the  multiband  group  delay  and  visibility  phase.  The 
uncertainties  of  the  residual  phases  in  the  frequency  channels 
at  the  edges  of  the  synthesized  band  are  approximately  20% 
smaller  than  those  in  the  center  band,  because  errors  (either 
random  or  systematic)  in  these  outer  channels  tend  to  be 
absorbed  into  the  estimate  of  the  group  delay  (see  Appendix  E 
for  a  discussion  on  the  methods  used  to  calculate  these 
uncertainties) . 

Figure  3.1.2  shows  approximately  the  behavior  of  the 
residual  phases  we  would  expect,  i.e.  zero  mean  and  random 
scatter,  although  the  scatter  of  the  values  is  approximately 
1.5  times  larger  than  we  would  expect  given  the  signal-to- 
noise  ratio  (SNR)  of  these  observations  (we  discuss  this  topic 
in  more  detail  later  in  this  section).  Figure  3.1.3  shows  a 
very  different  character.  The  residual  phases  do  not  have 
zero  means  and  there  are  systematic  variations  with  time.  We 
believe  we  understand  the  origin  of  most  of  these  systematic 
variations.  We  will  discuss  this  topic  in  Section  3.2  when  we 
investigate  the  behavior  of  the  phase  calibration  system.  We 
will  concentrate  in  this  section  on  the  mean  values  of  the 
residual  phases  and  their  scatter  about  these  mean  values. 

We  hope  to  answer  several  questions  by  studying  the 
behavior  of  the  residual  phases:  a)  what  is  the  origin  of 
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the  nonlinear  behavior  of  the  visibility  phase  with  frequency? 
b)  what  is  the  long  term  stability  of  the  mean  values  of  the 
residual  phases?  and  c)  what  can  we  deduce  about  the  uncer¬ 
tainties  of  the  group  and  phase  delay  measurements  based  on 
the  scatter  of  the  residual  phases? 

The  non-zero  mean  values  of  the  residual  phases  indicate 
that  the  Mark  III  system  is  dispersive.  We  would  expect  the 
Mark  III  receivers  to  be  dispersive,  but,  we  should  also 
expect  the  phase  calibration  system  to  remove  most  of  this 
dispersion  (we  will  discuss  below  the  meaning  of  "most" 
shortly) ,  because  the  phase  calibration  pulses  travel  with  the 
signals  through  nearly  all  of  the  system.  The  parts  of  the 
receiving  system  that  are  not  calibrated  by  the  phase  cali¬ 
bration  system  are  the  feed  horn  and  the  waveguide  up  to  the 
point  where  the  phase  calibration  pulses  are  injected.  After 
this  injection  point  any  dispersions  in  the  system  should  be 
calibrated  and  removed  by  the  phase  calibration  system. 

So  let  us  start  our  investigations  of  the  origin  of  the 
dispersion  with  a  discussion  of  the  dispersive  properties  of 
feed  horns  and  waveguides.  Propagation  through  waveguides  is 
discussed  in  most  books  on  electromagnetic  radiation.  The 
phase  velocity  of  a  wave  propagating  through  a  hollow  cylin¬ 
drical  waveguide  with  cut-off  frequency  u>x  (i.e.,  the  lowest 
frequency  wave  which  will  still  propagate  through  the  wave¬ 
guide)  is  (Jackson,  1975,  p.  344,  MRS  units) , 

vp  =  (l/Vye)  (l- (wx/w)  2)"1//2 
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where  e  and  are  the  permittivity  and  permeability  of  the 
material  inside  the  waveguide  (in  our  case  this  material  is 
air  and  hence  1/Ven  =  c,  the  free  space  velocity  of  propaga¬ 
tion)  ,  and  w  is  the  frequency  of  the  propagating  wave.  The 
change  in  phase  of  a  signal  propagating  through  a  waveguide 
with  a  free  space  length  of  t  will  be 

A<J>  =  tu)  ( 1- (u>,  /w)  ^  .  (3.1.1) 

The  group  delay,  Atw ,  for  propagation  through  the  waveguide 
may  be  obtained  by  differentiating  Equation  (3.1.1)  with 
respect  to  angular  frequency.  This  differentiation  yields 

Axw  =  tq  (1  -(ux/u)2)~1/2  .  (3.1.2) 

Clearly  the  propagation  through  the  waveguide  is  disper¬ 
sive.  But  is  this  dispersion  sufficiently  large  to  cause  the 
offsets  in  the  residual  phases  seen  in  Figure  3.1.3?  To 
investigate  this  possibility,  the  phase  changes  due  to  the 
propagation  of  a  signal  through  a  10'  long  waveguide  (tq  =  10 
nsec)  with  a  cutoff  frequency  of  5.3  GHz  (cutoff  wavelength  of 
*  5.7  cm)  were  calculated.  (This  cutoff  frequency  is  typical 
for  the  X-band  waveguides  in  the  Mark  III  receivers  (Rogers, 
1983,  private  communication).}  In  Figure  3.1.4  we  show  these 
phases  computed  at  the  frequencies  used  in  the  June,  1981 
experiment.  In  the  lower  part  of  this  figure  we  show  the 
residual  phases  which  would  be  seen  after  the  group  delay  and 
visibility  phase  had  been  estimated.  (We  have  assumed  that 
this  waveguide  precedes  the  injection  point  of  the  phase 
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calibration  pulses.)  We  can  clearly  see  in  Figure  3.1.4  that 
the  waveguide  dispersion  cannot  explain  the  residual  phase 
pattern  (as  a  function  of  frequency)  observed  on  the  Haystack 
to  Owens  Valley  baseline. 

(We  should  note  that  the  phase  delay  through  this  wave- 
guige  computed  from  the  least  squares  fit  to  the  calculated 
phases  is  7.8  nsec  while  the  group  delay  is  12.9  nsec.  There 
is  a  large  offset  between  the  group  and  phase  delay.  This 
offset  will  need  to  be  accounted  for  when  the  ionospheric 
delay  corrections  are  computed  from  group  and  phase  delays  (we 
will  discuss  this  topic  in  detail  in  Section  4.2.2).) 

Calculations  of  feed  horn  dispersions  yield  similar 
results  to  those  shown  in  Figure  3.1.4.  The  change  in  phase 
due  to  the  dispersion  may  be  large  (over  one  cycle  for  the 
waveguide  case) ,  but  the  residual  phases  will  be  small  because 
over  a  limited  spanned  bandwidth,  the  dispersion  is  very  close 
to  linear. 

The  ionospheric  dispersion  (see  Chapter  4  for  details) 

o 

will  produce  residual  phases  at  X  band  of  less  than  1  ,  again,, 

because  over  the  limited  spanned  bandwidth  the  inverse 

frequency  relationship  is  almost  linear.  (For  a  differential 
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electron  content  of  10  electrons/m  (corresponding  to  an 

ionospheric  delay  at  X-band  of  “1.3  nsec)  and  the  X-band 

frequency  sequence  shown  in  Figure  3.1.4,  the  maximum  residual 

o 

phase  would  be  -0.1  in  the  8.34  GHz  channel.  At  S-band,  where 

the  ionospheric  effects  are  much  larger,  the  residual  phases 
o 

could  reach  -0.4  in  the  2.70  GHz  channel  (assuming  the  channel 


sequence  is  2.295  GHz,  2.270  GHz  and  2.220  GHz).  This 
frequency  sequence  was  used  in  our  July  1980  and  June  1981 
VLB I  experiments. 

The  next  possible  origin  for  the  dispersion  which  we  will 
investigate  is  multiple  reflections.  We  will  study  a  simple 
example  of  a  multiple  reflection  that  will  allow  us  to 
characterize  the  general  properties  of  multiple  refections. 

For  a  multiple  reflection  to  exist  there  must  be  at  least  two 
reflecting  boundaries  in  the  propagation  path  of  the  signals 
(the  signal  in  this  case  could  be  either  the  radiation  from 
the  radio  source  or  the  phase  calibration  pulses) .  In  our 
simple  example,  a  multiply  reflected  signal  will  be  generated 
whenever  a  signal  is  partially  reflected  from  one  boundary, 
reversing  its  propagation  direction,  and  then  again  partially 
reflected  from  another  boundary,  again  reversing  its 
propagation  direction  (hence  the  name  "multiple  reflection") . 
In  Figure  3.1.5  we  show  the  geometry  for  multiple  reflections. 
The  characteristics  of  the  multiple  reflection  can  be 
specified  by  three  parameters:  the  distance  between  the 
reflecting  boundaries,  and  the  reflection  characteristics  of 
each  of  the  boundaries. 

The  specification  of  all  of  the  possible  reflecting 
boundaries  in  the  Mark  III  system  is  beyond  the  scope  of  this 
thesis.  We  will,  instead,  try  to  characterize  generally  the 
properties  of  the  multiple  reflections  in  the  Mark  III  system 
based  on  the  available  residual  phase  data.  The  examination 
of  all  of  the  reflecting  boundaries  in  the  Mark  III  system  is 
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actually  not  necessary.  Any  multiple  reflection  which  occurs 
after  the  injection  point  of  the  phase  calibration  pulses  will 
equally  affect  the  phases  of  both  the  signals  from  the  radio 
source  and  the  phase  calibration  signals.  When  the  phase 
calibration  phases  are  subtracted  from  the  video  DC  cross 
spectral  phases  (see  Appendix  B) ,  the  changes  in  cross 
spectral  phases  due  to  the  multiple  reflection  will  be  removed 
provided  that  the  multiple  reflection  does  not  occur  over  such 
a  large  distance  that  phase  "ripples"  occur  over  the  2  MHz 
recorded  bandwidth  (see  further  discussion  later  in  this 
section)  . 

In  the  lower  section  of  Figure  3.1.5  we  plot  as  a 
function  of  frequency  the  effect  of  a  multiple  reflection  on 
the  phase  of  a  signal  which  has  been  multiply  reflected.  The 
amplitude  of  the  phase  error  will  be  a  function  of  the  reflec¬ 
tion  properties  of  the  boundaries.  The  "wavelength"  of  the 
error,  i.e.  the  change  in  frequency  necessary  for  the  error  to 
complete  one  cycle,  will  be  a  function  of  the  distance  between 
the  reflecting  boundaries. 

We  will  derive  the  relationship  between  the  phase  errors 
and  frequency  as  a  function  of  the  distance,  D,  between  the 
reflectors,  and  the  Fresnel  reflection  coefficients  of  the 
boundaries,  and  R2  (see  Figure  3.1.5).  We  will  assume  that 
the  reflection  coefficients  are  sufficiently  small  that  we  can 
use  small  angle  approximations  in  calculating  the  phase 
errors.  (The  magnitude  of  the  dispersion  (<10°)  indicates  that 
these  approximations  should  be  adequate  for  our  calculations.) 


We  will  also  assume  that  the  reflecting  boundaries  are  normal 
to  the  direction  of  propagation  of  the  radiation. 

In  Figure  3.1.5  we  show  the  amplitude  of  the  electric 
field  after  each  interaction  of  the  signal  with  a  boundary. 

The  amplitude  of  the  directly  transmitted  signal  will  be  Et  = 
T1T2Eq  where  and  T2  are  the  two  Fresnel  transmission  coef¬ 
ficient  of  the  reflection  boundaries,  and  EQ  is  the  amplitude 
of  the  electric  field  before  any  reflections.  These  transmis¬ 
sion  coefficients  are  related  to  the  reflection  coefficents  by 
T^l+R^  and  T2  =  l+R2  (See,  for  example,  Kong,  1975,  Chapter 
4).  The  amplitude  of  the  multiply  reflected  signal's  electric 
field  will  be  Er  =  RiR2TlT2Eo  and  *ts  Phase»  relative  to  the 
directly  transmitted  signal,  will  be  $r  =  2Du>/vp,  where  D  is 
the  distance  between  the  reflectors,  and  vp  is  the  phase 
velocity  of  propagation  through  the  medium  between  the 
reflecting  boundaries.  The  phase  error,  A$mr,  at  frequency  w, 
due  to  the  muliple  reflection  will  be  (see  Figure  3.1.5) 

A<|»mr  =  R1R2sin(2Dw/vp)  .  (3.1.3) 

Does  this  expected  error  match  the  observed  residual  phases?  * 
The  widely  spaced  frequencies  used  in  geodetic  experi¬ 
ments  do  not  allow  us  to  answer  this  question  because  the 
phase  error  could  oscillate  many  times  in  the  frequency 

intervals  between  the  channels.  We  can,  however,  calculate  a 

o 

minimum  value  for  the  reflection  coefficients.  For  a  4  phase 
error  to  be  generated  by  multiple  reflections  (this  value  is 
the  maximum  mean  residual  phase  on  the  Haystack  to  Owens 
Valley  baseline  in  June  1981;  see  Figure  3.1.3),  the  product 
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R^R2  must  be  at  least  0.07.  If  we  assume  that  R^=R2r  then  the 
power  reflected  at  each  boundary  is  7%.  However,  mean 
residual  phases  of  up  to  16°  have  been  observed  for  other 
baselines  in  other  experiments  (see  Figure  3.1.6),  which  would 
imply  power  reflections  of  up  to  28%. 

To  study  the  frequency  dependence  of  the  multiple  reflec¬ 
tions  we  need  to  look  at  observations  with  uniformly  spaced 
frequencies  so  that  the  sinusoidal  signature  of  a  multiple 
reflection  can  be  seen.  This  type  of  observation  is  generally 
only  used  for  astronomy  observations  of  weak  sources  (where 
spanned  bandwidth  is  not  important) .  We  do,  however,  have  one 
set  of  observations  on  a  strong  radio  source.  (Strong  sources 
or  large  antennas  are  necessary  to  ensure  that  the  random 
errors  in  the  residual  phases  are  small.)  In  Figure  3.1.6  we 
show  plots  of  the  residual  phases  at  both  X-  and  S-band  from 
the  triplet  of  baselines  involving  Haystack,  NRAO  and 
Bffelsberg.  (These  stations  were  chosen  because  in  all  cases 

the  standard  deviations  of  the  residual  phases  were  less  than 
o 

1  .)  Each  plot  shows  results  from  observations  of  the  radio  -- 
source  4C39.25,  at  two  epochs  separated  by  2  hr  27  min.  This 
radio  source  was  only  observed  twice  in  this  experiment.  The 
primary  observations  in  this  experiment  were  of  1038+528A  and 
B  (Marcaide,  1982)  . 

The  residual  phases  in  Figure  3.1.6  do  not  match  the 
expected  sinusoidal  variation  expected  from  a  single  multiple 
reflection.  This  lack  of  agreement  could  be  due  to  there 
being  more  than  one  multiple  reflection  in  the  system  (e.g.. 


possibly  both  the  signal  from  the  radio  source  (before  injec¬ 
tion  of  the  phase  calibration  pulses)  and  the  phase  calibra¬ 
tion  pulses  (before  being  injected)  are  multiply  reflected. 

If  the  variations  of  the  residual  phases  are  due  to  multiple 
reflections,  we  can  calculate  a  minimum  distance  between  the 
reflecting  surfaces.  For  each  baseline  in  Figure  3.1.6,  the 
residual  phases  seem  to  "cycle"  at  least  once  over  the  28  MHz. 
From  Equation  (3.1.3),  we  see  that  for  A$mr  to  rotate  through 

one  cycle  in  28  MHz,  the  minimum  distance,  D  ,  between  the 

m  in 

reflectors  must  be 

Dn,in  ’  V(2Af)  *  6'25  ” 

where  we  have  taken  Vp=c.  The  phase  velocity  through  the 

waveguide  could  be  as  much  as  28%  greater  than  c  (see 

Equation  3.1.1),  thus  increasing  Dmin- 

The  residual  phases  for  the  two  epochs  agree  well  for  the 

Haystack  to  NRAO  baseline  and  reasonably  well  for  the  X-band 

band  results  involving  baselines  to  Effelsberg.  The  S-band 

results  for  the  baselines  involving  Effelsberg,  however,  show 

o 

differences  of  up  to  4  .  (This  is  approximately  10  times  the 
standard  deviation  of  the  difference  on  the  NRAO  to  Effelsberg 
baseline.)  There  are  no  apparent  errors  in  the  processing  of 
any  of  these  S-band  results  and  we  are  led  to  suspect  that  the 
difference  is  due  to  a  change  in  the  antenna  or  the  receiver 
between  these  two  observations.  The  Effelsberg  antenna  is  100 
m  in  diameter  (it  is  the  largest  fully  steerable  antenna  in 
the  world) ,  and  possibly  we  are  seeing  some  changes  in  the 
geometry  of  the  antenna  (the  observations  were  at  elevations 
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o  o 

of  72  and  50  ) .  There  is  also  a  possibility  that  these 
variations  are  due  to  the  phase  calibration  error  which  will 
be  discussed  in  Section  3.2.  Unfortunately  with  only  two 
observations  it  is  not  possible  to  identify  this  error  (see 
the  method  used  to  detect  phase  calibration  errors  in  Section 
3.2)  . 

Another  possible  cause  of  the  dispersion  is  asymmetry  of 
the  phase  calibration  pulse  shape.  This  problem  is  very 
similar  to  multiple  reflection  of  the  phase  calibration 
pulses,  because  a  multiple  reflection  results  in  a  distortion 
of  the  pulse  shape.  The  effects  of  pulse  shape  on  the  phase 
of  the  calibration  pulses  can  be  studied  using  the  convolution 
theorem  of  Fourier  transforms  (Bracewell,  1978,  p.108) .  The 
phase  calibration  pulses,  pc ( t) ,  can  be  represented  (in  the 
time  domain)  by 

09 

pc(t)  =  /  h(u)  III(t/t  -u)  du  (3.1.4) 

where  h(u)  is  the  pulse  shape  function,  tp  is  the  time 
interval  between  pulses,  and  III(t/t  -u)  is  the  sampling  or 

r 

replicating  function  (Bracewell,  1978  pp .  77-79)  and  repre¬ 
sents  an  infinite  series  of  Dirac  delta  functions  separated  by 


The  Fourier  transform  of  Equation  (3.1.4)  will  be  given 
by  (using  the  convolution  theorem) 

Pc(w)  =H(w)  IIIU/w  )  (3.1.5) 

where  u>p  is  the  frequency  spacing  between  the  comb  elements  of 
the  phase  calibration  rails  (see  Section  2.3)  and  equals 


2n/tp.  From  Equation  (3.1.5),  we  can  see  that  the  phase  of 
Pc(w)  will  depend  on  the  Fourier  transform  of  the  pulse  shape 
We  should  notice  that  if  the  pulse  is  very  sharp,  i.e.  if  the 
duration  of  the  pulse  is  very  small,  then  the  phase  of  H (w) 
cannot  vary  rapidly  with  frequency.  (This  result  can  be 
deduced  from  the  bandlimited  nature  of  the  pulse  in  the  time 
domain.)  From  Figure  3.1.6  we  can  deduce  properties  of  the 
pulse  shape,  if  the  pulse  shape  is  the  cause  of  dispersion. 
Since  we  see  deviations  of  5°  in  the  residual  phases  which 
seem  to  cycle  in  28  MHz,  we  can  deduce  that  *0.1  (sin  5°)  of 
the  power  in  the  pulse  must  occur  at  least  0.035  nsec  (1/28 
MHz)  away  from  the  main  pulse.  Given  that  the  pulse  is 
supposed  to  be  less  than  0.00005  nsec  duration  these  numbers 
seem  very  unlikely. 

These  results  do  suggest  a  possible  cause  for  the 
dispersion.  The  Mark  III  pulse  generator  produces  a  pulse 
every  zero  crossing  of  the  5  MHz  input  signal,  i.e.,  10 
pulses/nsec  are  produced.  Nine  of  the  ten  pulses  should  be 
suppressed,  or  gated,  before  they  enter  the  Mark  III  receiver 
However,  if  the  gating  electronics  are  not  working  properly, 
then  some  of  these  additional  pulses  could  be  entering  the 
receiver  and  corrupting  the  phase  calibration  signal  extrac¬ 
tion.  An  additional  pulse  could  be  0.1  usee  away  from  the 
primary  pulse  which  is  in  accord  with  the  minimum  separation 
which  we  calculated  from  Figure  (3.1.6)  (0.035  usee).  The 
gating  electronics  should  be  carefully  checked. 

We  have  now  studied  several  possible  causes  of  the  dis- 


persion  in  the  Mark  III  system  and  none  of  the  causes  (indi¬ 
vidually)  could  fully  explain  the  observed  ampltitude  verus 
frequency  pattern  of  the  dispersion.  All  of  the  possible 
causes  studied  probably  contribute  to  the  dispersion  to  some 
extent.  An  analysis  of  the  Mark  III  hardware  is  now  being 
commenced  to  ascertain,  if  possible,  the  origin(s)  of  the 
dispersion.  We  now  investigate  the  seriousness  of  the  disper¬ 
sion  on  the  measurement  of  the  visibility  phase  and  the  group 
delay. 

The  dispersion  in  the  Mark  III  system  can  cause  two 
problems.  Firstly,  if  the  relative  weights  of  the  data  in 
each  frequency  channel  change  with  each  observation  then  the 
contribution  of  the  dispersion  to  the  group  delay  will  change 
because  the  group  delay  estimation  is  a  weighted  least  squares 
fit  of  a  slope  to  the  cross  spectral  phases  (see  Appendix  A) . 
Secondly,  if  the  dispersion  is  time  dependent  then  there  will 
also  be  variations  of  the  group  delay. 

The  first  of  these  points  may  be  addressed  by  simply 
calculating  the  weighted  least  squares  fits  to  a  set  of  hypo¬ 
thetical  cross  spectral  phases  using  different  weights  in  each 
channel.  In  Figure  3.1.7  we  show,  as  an  example,  the  results 
of  such  a  series  of  calculations.  We  have  chosen  the  S-band 
frequency  sequence  which  was  used  in  experiments  up  to  June 
24,  1981.  The  frequency  channels  are  located  at  2.295  GHz, 
2.270  GHz,  2.220  GHz.  This  sequence  was  chosen  because  in 
Chapter  4  we  will  be  analyzing  extensively  data  taken  at  these 
frequencies.  In  addition  the  S-band  group  delays  will  be  most 
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sensitive  to  weight  changes  because  of  the  narrow  spanned 
bandwidth  (75  MHz) ,  and  the  use  of  only  three  frequency  chan¬ 
nels.  To  generate  Figure  3.1.7  we  assumed  that  there  was  a  5° 
dispersion  in  the  2.220  GHz  channel  and  that  the  weight  of 
this  channel  relative  to  the  2.295  Ghz  and  2.270  GHz  channels, 
whose  weights  remained  constant,  changed  between  0  and 
infinity.  The  group  delay  errors  are  referred  to  the  case 
when  the  weights  in  all  channels  are  equal  because  this  should 
be  the  typical  value  for  most  observations.  (The  change  in 
weight  of  the  2.220  GHz  could  be  due  either  to  changes  to  the 
amplitude  of  the  signal  or  to  changes  in  the  number  of 
accumulation  periods  used  from  the  channel.)  In  Figure  3.1.7, 
we  see  that  even  for  a  small  dispersion  the  errors  in  the 
group  delay  can  be  as  large  as  0.2  nsec.  However,  we  should 
notice  that  large  errors  (>2  cm)  only  develop  for  very 
disproportionate  weights  among  the  frequency  channels.  (Note 
that  the  multiband  group  delay  does  not  approach  the  single¬ 
band  group  delay  as  the  weight  of  the  2.220  GHz  channel 
approaches  infinity  because  these  two  delay  types  are  treated 
as  independent  parameters  by  FRNGE;  see  Appendix  B.3  for 
details.) 

We  now  address  the  constancy  of  the  dispersion.  Even 
though  the  precise  origin  of  the  dispersion  is  not  known,  we 
can  study  the  constancy  of  the  dispersion  by  accumulating 
their  statistics  from  several  experiments.  There  are  two  data 
sets  which  we  can  use  for  these  studies  -  the  short  MERIT 
campaign  and  the  POLARIS  experiments  (see  Chapter  5  for 
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experiment  details) .  We  study  these  two  sets  of  data  because 
for  long  periods  of  time  (one  month  for  MERIT  and  six  months 
for  POLARIS)  the  Mark  III  systems  were  left  undisturbed  at  the 
radio  telescopes. 

In  Figure  3.1.8,  we  show  one-day  averages  of  the  residual 

phases  from  the  short  MERIT  campaign  for  three  baselines: 

Haystack-OVRO  130,  Haystack-Onsala  60,  and  OVRO  130-Onsala  60 

(see  Figure  5.1  and  Table  5.1  for  station  locations  and 

definitions  for  terminology) .  Each  point  plotted  is  an 

average  of  between  60  and  100  residual  phase  values  and  hence 

the  uncertainty  of  each  mean  value  is  quite  small,  typically 
o 

<0.5.  The  uncertainties  of  the  mean  values  are  plotted  at 

representative  points.  These  uncertainties  were  calculated 

from  the  formal  standard  deviation  of  the  mean,  scaled  by  the 

normalized- root-mean-square  (NRMS)  scatter  of  the  individual 

values  about  the  daily  mean.  (The  NRMS  is  the  square  root  of 
o 

the  x  per  degree  of  freedom  of  the  fit  of  the  observations  to 
the  mean  value?  see  detailed  discussion  later  in  this 
section.) 

One  feature  in  Figure  3.1.8  is  very  noticeable.  On 
October  20,  there  is  an  apparent  change  in  the  dispersion  at 
OVRO  130.  We  should  notice  that  the  error  bars  for  this  day 
are  much  larger  than  those  for  adjacent  days,  although  the 
difference  between  the  mean  residual  phase  for  this  day  and 
for  the  adjacent  days  is  still  many  standard  deviations,  “60. 
The  reason  for  the  increase  in  the  uncertainty  is  related  to 
the  apparent  change  in  the  dispersion.  In  Figure  3.1.9  we 
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Figure  3.1.8  Daily  mean  residual  phases  from  the  short  MERIT  campaign 
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show  the  residual  amplitude  for  the  anomalous  day  and  the 
preceding  day.  (The  residual  amplitudes  are  the  differences 
between  correlation  amplitudes  in  each  channel  and  the  mean 
correlation  amplitude  for  the  observation.)  The  symbols  in 
these  plots  have  the  same  meanings  as  those  in  Figures  3.1.1 
and  3.1.2. 

It  is  clear  from  Figure  3.1.9  that  the  residual  ampli¬ 
tudes  show  much  more  scatter  on  20  October  than  on  19  October. 
(The  scatter  on  21  October  was  the  same  as  that  on  19 
October.)  Also  Channel  4  (8.24  GHz)  has  systematically  lower 
amplitudes  than  do  the  other  frequencies. 

We  mentioned  earlier  in  this  section  that  one  of  the 
major  problems  associated  with  the  dispersion  is  the  apparent 
change  in  the  group  delays,  and  consequently  the  apparent 
change  in  the  dispersion,  when  the  weight  given  to  a  channel 
is  changed.  The  anomalous  behavior  on  20  October  gives  an 
indication  of  how  large  these  effects  may  be. 

If  we  exclude  20  October  from  Figure  3.1.8,  we  see  that 

there  are  still  significant  variations  of  the  dispersion 

between  the  daily  averages.  These  variations  seem  to  be  less 
o 

than  1  per  day  in  most  cases.  What  does  this  indicate  about 
the  constancy  of  th'  dispersion?  To  answer  this  question,  we 
must  remember  that  the  group  delay  and  visibility  phase  have 
been  estimated  from  cross  spectral  phases  and  hence  any  varia¬ 
tions  in  the  dispersion  will  be  partially  absorbed  into  these 
estimates.  Consequently,  the  real  change  in  the  dispersion 
will  appear  smaller  in  the  residual  phases.  (The  phase  delay 
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rate  is  also  estimated  from  the  cross  spectral  phases. 

However,  since  the  group  and  phase  delays  are  referred  to  an 
epoch  near  the  (temporal)  center  of  the  data,  the  phase  delay 
rate  estimate  is  decoupled  from  the  delay  measurements.  We 
may  therefore  study  the  variations  in  the  dispersion  by 
considering  only  estimates  of  the  group  delay  and  visibility 
pha  se . ) 

We  may  calculate  the  expected  reduction  in  the  varia¬ 
tions  of  the  dispersion  due  to  the  estimation  of  the  group 
delay  and  visibility  phase  based  on  the  formulas  given  in 
Appendix  E.  In  Table  3.1.1  we  give  the  numerical  values  of 
matrices  applicable  to  studying  dispersion  variations  (see 
Appendix  E  for  a  discussion  of  the  techniques  used  to  generate 
Table  3.1.1).  We  have  generated  this  table  for  the  frequency 
sequences  used  up  to  June  24,  1981  because  most  of  the  data  we 
will  be  analyzing  were  obtained  at  these  frequencies. 

Of  most  importance  to  us  at  this  time  are  the  numerical 
values  associated  with  Equations  (E.5)  and  (E.6).  Equation 
(E.5)  relates  the  change  in  the  dispersion  (denoted  by  vector 
£r  in  Table  3.1.1)  to  the  changes  in  the  group  delay  and 
visibility  phase.  Equation  (E.6)  relates  the  change  in  the 
dispersion  to  the  change  in  the  residual  phases,  v,  i.e.  to 
the  apparent  change  in  the  dispersion  as  deduced  from  residual 
phases  such  as  those  shown  in  Figure  3.1.8.  (We  should  note 
that  the  matrix  which  relates  £r  to  v  is  singular  and  hence  it 
is  not  possible  to  obtain  unique  values  for  £r  given  v;  see 
Appendix  E  for  further  discussion.) 


Table  3.1.1  Numerical  values  for  studying  the  effects  of 
changes  in  dispersion 


For  X-band 

Frequency  sequence  8.34,  8.49,  8.54,  8.24  GHz. 


From  Equation  (E.2), 

Ar  = 


d>  =  Ax 
-p 


1.0 

1.0 

1.0 


0.00 

0.15 

0.20 


1.0  -0.10 


where  <J>  and  <j>  are  in  cycles,  and  t  is  in  nsec, 
-r  -t  g 


From  Equation  (E.5.a),  x 

r 


'  0.319  0.154 

-1.099  1.538 


0.099  0.429’ 

2.418  -2.857 


cycle/cycle 
nsec  /cycle 


From  Equation  (E.6),  v  =  Ar“£r  =  (I-AA”)£r 


v 


’  0.681  -0.154  -0.099  -0.429' 
-0.154  0.615  -0.462  -0.001 
-0.099  -0.462  0.417  0.143 
-0.429  -0.001  0.143  0.285 


E 

— r 


where  is  the  change  in  the  dispersion  and  v  is  the  chang 
in  the  residual  phases  which  would  be  produced  by  _e 


Table  3.1.1  Continued. 

For  S-band 

Frequency  sequence  2.295,  2.270,  2.220  GHz. 

From  Equation  (E.2)  ,  _$r  *  AXp 

*  1.0  0.000 

1.0  -0.025 

_  1.0  -0.075 

where  and  are  in  cycles,  and  is  in  nsec. 

From  Equation  (E.5.a),  ^ 

A“  =  T  0.714  0.429 

11.428  2.857 

From  Equation  (E.6),  v  =  ^r-£r  *  (I-AA~)£r 

0.286  -0.429  0.143 

0.429  0.642  -0.214 

0.143  -0.214  0.072 

where  £r  is  the  change  in  the  dispersion  and  v  is  the  change 
in  the  residual  phases  which  would  be  produced  by  t_r 


-0.143]  cycle/cycle 
-14.286  nsec  /cycle 


If  we  now  reconsider  the  variations  in  the  mean  residual 
phases  shown  in  Figure  3.1.8,  we  can  make  some  estimates  of 
the  stability  of  the  Mark  III  system.  As  mentioned  earlier  we 
cannot  obtain  unique  estimates  of  the  changes  in  the  disper¬ 
sion  from  the  residual  phases.  One  estimate  of  the  variations 
in  the  dispersion  is  exactly  the  variations  in  the  residual 
phases,  i.e.  one  solution  to  Equation  (E.6)  is  £rssv*  There 
would  be  no  variations  in  the  delay  introduced  by  the  disper¬ 
sion  if  the  £r  were  equal  to  v.  This  solution  is  very 
unlikely  to  be  relevant.  We  may,  however,  use  this  solution 
to  obtain  an  estimate  of  the  minimum  variation  of  the 
dispersion.  The  weighted  root-mean-square  (WRMS)  scatter  of 
the  mean  residual  phases  about  the  mean  values  for  each 
channel  and  baseline,  is  0?7  (excluding  20  October). 

Variations  of  the  dispersion  of  at  least  this  amount  occur  in 
the  Mark  III  system.  We  cannot  place  an  upper  bound  on  the 
variations  of  the  dispersion  (based  on  the  residual  phases) 
because  any  variation  in  the  dispersion  which  mimics  a  delay 
or  a  phase  offset  will  be  absorbed  into  the  group  delay  and 
visibility  phase  estimates  and  hence  not  be  seen  in  the 
residual  phases. 

The  RMS  variation  of  0?7  in  the  dispersion  implies  varia¬ 
tions  in  the  group  delays  of  up  to  8  psec/day  at  X  band  and  36 
psec/day  at  S-band  (these  values  are  obtained  from  the  numeri¬ 
cal  values  of  Equation  (E.5)  given  in  Table  3.1.1).  We  will 
compare  these  estimates  with  other  estimates  of  the  stability 
of  the  Mark  III  system,  obtained  from  predicting  phase  delay 
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ambiguities  from  group  delays,  in  Section  4.3. 

We  may  further  study  the  stability  of  the  Mark  III  system 
on  a  time  scale  of  6  months  by  analyzing  the  mean  residual 
phases  from  the  POLARIS  experiments.  In  Figure  3.1.10  we 
show,  for  a  six-month  interval,  the  weekly  values  of  the  mean 
residual  phases  from  24  hr  duration  observing  sessions  from 
the  eight  channel,  single  sideband  Westford-HRAS  baseline 
observations.  Again,  we  see  that  the  dispersion  is  large,  up 
to  7°  for  some  experiments. 

In  general,  the  dispersion  remains  constant  or  varies 
smoothly  over  the  six  month  period  except  for  one  discontinu¬ 
ity  (in  some  channels)  which  occurred  between  9/30/81  and 
10/15/81.  The  mean  residual  phases  from  these  two  experiments 
are  plotted  as  a  function  of  frequency  in  Figure  3.1.11. 
Clearly,  there  seem  to  be  changes  in  this  dispersion  curve  at 
both  high  and  low  frequencies.  These  changes  would  give  us  an 
indication  of  the  origin  of  the  dispersion,  if  we  could 
isolate  a  specific  change  in  the  configuration  of  the  Mark  III 
system  at  one  of  the  antennas,  because  then  we  could  deduce 
that  this  change  affects  the  dispersion. 

The  station  logs  at  Westford  indicate  that  no  changes 
were  made  between  the  two  experiments  which  show  the  discon¬ 
tinuity  (Webber,  1983,  private  communications).  At  HRAS,  no 
detailed  station  logs  were  being  kept  (at  that  time) ,  and 
hence  we  could  not  establish  whether  any  changes  had  been  made 
to  the  receiver  (Sebring,  1983,  private  communication) . 
(Shortly  after  this  time  a  detailed  station  log  was  started.) 
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For  most  of  this  section  we  have  discussed  the  behavior 
of  the  mean  of  the  residual  phases  from  24  hr  observing 
sessions  in  an  attempt  to  study  the  long  term  behavior  of  the 
dispersion  in  the  Mark  III  system.  We  now  turn  our  attention 
to  the  scatter  of  the  residual  phases  from  each  observation 
about  the  24  hr  mean  values.  These  results  are  compiled  in 
Figure  3.1.12  where  we  have  plotted  the  normalized 
root-mean-square  (NRMS)  scatter  as  a  function  of  the  expected 
scatter  of  the  residual  phases.  (The  NRMS  is  the  square  root 
of  the  x  pec  degree  of  freedom  of  the  residual  phases  about 

2 

their  mean  values.  We  plot  this  quantity,  rather  than  the  x 
per  degree  of  freedom,  in  order  to  decrease  the  dynamic  range 
of  the  plot.) 

We  see  clearly  from  this  figure  that  the  noise  in  the 
residual  phases  does  not  match  the  expected  statistics  of  this 
noise.  The  statistics  of  the  residual  phases  indicate  two 
properties  of  the  performance  of  the  Mark  III  system. 

Firstly,  there  appears  to  be  a  limit  on  the  accuracy  of  the 
Mark  III  system,  i.e.  even  for  observations  of  very  strong 
radio  sources  the  residual  phases  still  show  scatter  which 
presumably  arises  from  instrumental  noise  that  is  independent 
of  the  radio  source  being  observed  (e.g.,  the  effects  of 
variation  in  the  weights  of  each  channel  when  there  is  a  large 
dispersion  present;  variations  in  the  dispersion  (see  earlier 
discussions);  and  "noise"  in  the  phase  calibration  system;  see 
Section  3.2).  Secondly,  the  NRMS  scatter  does  not  approach 
unity  as  the  signal  strengths  decrease.  This  behavior  could 


Normalized  RMS  scatter 


t*VT  ■  i' 


Figure  3 


-93- 

.1.12  Normalized  RMS  scatter  of  the  residual  phases  as  a  function 
of  their  theoretical  scatter 
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be  due  to  instrumental  noise  sources  which  are  a  function  of 
signal  strength,  e.g.  the  correlator  non-reproducibility  seems 
to  behave  in  this  fashion  (see  Appendix  B.4). 

We  have  empirically  fitted  a  model  to  the  behavior  of  the 
NRMS  scatter  which  is  shown  as  the  solid  line  in  Figure 
3.1.12.  This  empirical  model  relates  the  "actual"  standard 
deviation  of  the  residual  phases,  (as  deduced  from  their 
scatter)  ,  to  their  formal  standard  deviations,  <jq,  by 

3t  =  al°l  *  (3.1.6) 

where  ar  and  a are  the  parameters  to  be  estimated.  The  solid 
curve  shown  in  Figure  3.1.12  corresponds  to  or»1.5  and  ac=l°. 
(These  values  were  approximately  calculated  to  match  the 
results  shown  in  Figure  3.1.12.)  These  results  will  form  the 
first  part  of  our  compilation  of  the  performance  of  the  Mark 
III  system. 

In  Figure  3.1.13,  we  show  a  compilation  of  the  statistics 
of  the  residual  phases  similar  to  that  shown  in  Figure  3.1.12, 
for  the  intercontinental  experiment  conducted  in  November  1981 
and  the  transcontinental  experiment  in  June  1981.  In  this 
figure  we  also  show  the  empirical  law  for  the  relationship 
between  the  NRMS  and  the  formal  estimate  of  the  scatter  which 
was  deduced  from  the  MERIT  data.  It  is  clear  that  in  the 
eight-  to  twelve-month  period  between  these  experiments  and 
the  MERIT  experiments,  the  scaling  factor  in  the  empirical 
relationship  seems  to  have  come  closer  to  unity.  This 
improvement  is  probably  related  to  the  improvement  in  the  tape 
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Figure  3.1.13  Normalized  RMS  scatter  of  the  residual  phases  as  a  function  of  their 
I  Theoretical  scatter 


quality  between  these  experiments.  Unfortunately,  there  are 
no  very  small  standard  deviation  observations  in  the  recent 


experiments  which  would  allow  us  to  assess  the  effect  on  <?c. 
(The  formal  uncertainties  of  the  residual  phases  are  larger  in 
November  1981  than  in  June  1981  and  MERIT,  because  the 
November  experiment  used  single  sideband  recording.  We  would 
expect  an  increase  in  the  formal  errors  of  */2  because  less 
data  are  used  to  form  a  channel  with  single  sideband 
recording. ) 

We  will  return  to  these  discussions  of  equipment  perfor¬ 
mance  after  we  have  discussed  the  phase  calibration  system 
performance  and  the  delay  closures  around  triplets  of  base¬ 
lines  . 


3.2  Phase  calibration  system 

In  Section  2.3,  we  discussed  the  operation  of  the  phase 
calibration  system.  We  now  discuss  the  analysis  of  the  phase 
calibration  measurements.  Firstly,  we  consider  the  components 
of  the  Mark  III  system  which  affect  the  phase  of  the  calibra¬ 
tion  signals.  We  will  then  discuss  the  interpretation  of  the 
calibration  phases  from  some  VLBI  experiments.  As  with  the 
residual  phase  behavior,  we  will  defer  detailed  analysis  of 
the  individual  observations  until  Section  4.3,  when  we  discuss 
the  phase  delay  ambiguity  elimination  results. 

In  Figure  3.2.1  we  show  the  calibration  phases  for  a 
six-hour  period,  from  the  8.34  GHz  channel  at  Onsala,  on  July 


Phase  of  the  calibration  signal  in  the  8.34  GHz  channel 
at  Onsala  during  observations  of  3C  345  and  NRA0  512 
on  July  27-28, 1980. 


28,  1980.  (The  lower  case  letters  in  this  and  other  plots  in 
this  thesis,  denote  observations  which  have  been  flagged  as 
defective  by  FRNGE ;  see  beginning  of  Chapter  3.)  These  phases 
drift  between  ±180°  over  the  six-hour  period.  What  do  we  hope 
to  learn  from  these  phases?  To  answer  this  question  we 
firstly  need  to  consider  the  components  of  the  Mark  III  system 
which  affect  these  phases.  In  Section  2.3,  we  discussed  the 
operation  of  the  phase  calibration  system.  A  5  MHz  signal  is 
sent,  via  coaxial  cable,  to  the  receiver  where  it  is  used  to 
generate  the  LO  frequencies  for  the  first  mixers  and  the 
pulses  for  the  phase  calibration  system.  These  phase  calibra¬ 
tion  pulses  are  injected  into  the  signal  train  before  the 
first  mixing  and  amplification  operations.  The  pulses  then 
propagate,  with  the  signals  from  the  radio  source,  through  the 
same  components  of  the  Mark  III  system.  Based  on  this  flow  of 
the  calibration  signals,  we  can  write  an  expression  for  their 
phase: 

♦cal  *  ♦cable  +  ♦lo  +  ♦vc  +  ♦instr  '  (3.2.1) 

where  $cal  is  the  phase  of  the  calibration  signals,  $cat>ie  *s 
the  contribution  to  ♦ca^  from  the  length  of  cable  carrying  the 
5  MHz  signal  (usually  we  only  monitor  changes  in  the  cable 
length;  see  Section  2.3),  $L0  is  the  contribution  of  the  phase 
of  the  first  local  oscillator,  $  that  of  the  oscillator  in 

V  L> 

the  video  converter  for  the  channel,  and  ♦jnstr  that  of  the 
instrumental  delays. 

Of  the  four  contributions  to  ♦ggi*  the  phase  due  to  the 


first  local  oscillator  is  the  most  rapidly  varying.  However, 
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this  term  is  common  to  all  channels  within  one  frequency  band 
(either  X-  or  S-band;  see  discussion  on  heterodyning  in 
Section  2.1),  and  hence  may  be  removed  by  differencing  the 
calibration  phases  from  two  channels.  In  Figure  3.2.2,  we 
show  such  differences  for  the  Onsala  system,  after  removing 
mean  differences.  We  see  now  that  the  rapid  variations  seen 
in  Figure  3.2.1  no  longer  exist,  indicating  that  most  of  the 
variation  in  that  figure  was  due  to  the  phase  changes  of  the 
first  LO. 

We  may  use  these  differenced  phases  to  compute  the 
contribution  of  the  phase  calibration  signals  to  the  group 
delays  by  estimating  the  derivative  with  respect  to  angular 
frequency,  of  the  calibration  phase  differences  (see  discus¬ 
sions  in  Appendix  B  on  the  use  of  the  phase  calibration 
phases) .  These  delays  will  be  referred  to  as  the  "phase 
calibration  group  delays."  We  show  the  contribution  of  the 
phase  calibration  signals  to  the  group  delays  at  Onsala  in 
Figure  3.2.3. 

We  now  examine  in  some  detail  the  phase-calibration 
results  at  other  sites.  In  Figure  3.2.4  we  show  the  contribu¬ 
tion  of  the  calibration  phases  to  the  group  delay  at  Haystack 
during  July  27-28,  1980.  (This  day  was  devoted  mostly  to  the 
3C345/NRA0512  difference  experiment  which  will  be  analyzed  in 
Section  4.3.)  We  should  notice  that  for  the  last  three  hours 
of  this  experiment,  the  phase  calibration  group  delays  show  an 
oscillating  pattern.  The  residual  phases  for  baselines 
involving  Haystack  for  this  same  interval  of  time  also  show  a 


Differences  between  the  calibration  phases  at  8.54  GHz  and  8.24  GHz,  and  the 
8.34  GHz  calibration  phase,  respectively  for  the  observations  in  Figure  3.2.1 
(See  Figure  3.2.1  for  details  of  symbols.) 
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similar  behavior.  When  these  data  were  further  analyzed,  it 
was  found  that  both  the  residual  phase  data  and  the  oscilla¬ 
tion  of  the  phase  calibrated  group  delays,  were  artifacts  of 
the  phase  calibration  system,  i.e.  there  were  errors  in  the 
calibrations  not  in  the  quantities  which  were  being 
calibrated . 

The  major  evidence  for  this  conclusion  is  shown  in  Figure 
3.2.5.  We  plot  for  each  frequency  channel,  the  amplitude  of 
the  phase  calibration  signal  as  a  function  of  its  phase. 

There  are  clearly  sinusoidal  variations  of  the  amplitude  with 
the  signal  phase.  This  type  of  pattern  is  consistent  with  the 
addition  of  a  coherent  signal  to  the  phase  calibration  signal. 
We  show  graphically  the  effects  of  such  an  additional  signal 
in  Figure  3.2.6.  (We  shall  call  this  additional  signal,  the 
"spurious  signal.")  If  we  assume  that  the  spurious  signal  has 
a  constant  amplitude  and  phase  relative  to  the  phase 
calibration  signal  we  can  write  an  expression  for  the  observed 
amplitude  as  a  function  of  the  phase  of  the  calibration 
signal.  From  the  geometry  of  Figure  3.2.6,  we  have 


*obs  ■  *cal  +  +  2AcalVos<»cal-V  ' 


(3.2.2) 


where  A 


obs' 


cal' 


and  A  are  the  observed  calibration  ampli- 
s 


tudes,  the  actual  calibration  amplitude,  and  the  spurious 


signal  amplitude,  respectively,  and  $obs»  $cal,  and  $s  are  the 
observed  calibration  phase,  the  actual  calibration  phase  and 


the  spurious  signal  phase,  respectively.  We  now  wish  to 


develop  an  algorithm  which  will  allow  us  to  calculate  Ag  and 


CKPNNCL  2  1 0*9*1 


'  s  at  different  values  of  $ 


♦g  from  a  set  of  AQbg 


obs-  Once  As 


and  $s  are  known  we  can  then  reconstruct  Acal  and  ♦  ^  from 

the  observed  calibration  signals. 

If  we  assume  that  Ag  <<  Aca^,  then  we  can  linearize 

Equation  (3.2.2),  which  will  make  the  estimation  of  A  and  4> 

s  s 

much  simpler.  The  linearization  of  Equation  (3.2.2)  yields 
Aobs  *  Acal+  AscosUcal-^s)  ' 


which  may  be  rearranged  to  yield, 

A  ,  a  A  -  +  Acco s ( *  , )  +  Assin (*  , )  , 

obs  cal  s  ycal  s  ycal 


(3.2.3) 


where  A^=A  cos*  and  A®=A  sin* .  If  A_  <<  A__, ,  then  e,  the 
error  in  the  calibration  phase,  will  be  small,  and  we  may 
approximately  equate  4>ca^  and  40bg  in  the  cosine  and  sine 
terms  in  Equation  (3.2.3).  This  substitution  yields 


Aobs  •  Acal  *  AScos<W  +  Assin(W 


(3.2.4) 


Also  based  on  the  geometry  of  Figure  3.2.6,  we  may  write  an 
expression  for  the  error  in  the  calibration  phase,  e.  Hence 

e  =  tan“1[ (  Ajsin^obs-A®cos^obs ) /Acal ]  (3.2.5) 

and  the  corrected  calibration  phase  will  be  given  by 

♦cal  =  ♦obs  +  e  *  (3.2.6) 


In  Figure  3.2.5,  we  have  superimposed  the  expected  amplitude 
variations  with  changes  in  the  phase  calibration  signals. 

(The  differences  between  the  predicted  curve  for  the  amplitude 
versus  phase  and  the  observed  values,  for  the  8.34  GHz  chan- 
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nel,  are  due  to  a  variation  of  the  calibration  amplitude  with 
time;  see  discussion  in  Section  2.3.) 

The  Haystack  radio  telescope  was  not  the  only  antenna  to 
be  affected  by  a  spurious  signal  during  the  July  1980  experi¬ 
ment.  The  Onsala  antenna  also  was  affected.  We  show  the 
phase  calibration  signal  amplitude  versus  phase  values  for 
these  observations  in  Figure  3.2.7  Again  we  see  a  very  dis¬ 
tinctive  dependence  of  the  amplitude  on  the  phase,  especially 
in  the  8.34  GHz  and  8.54  GHz  channels. 

We  show  in  the  Figure  3.2.8,  the  phase  calibration  group 
delays  at  Haystack  and  at  Onsala  before  and  after  correcting 
for  the  spurious  signals.  We  notice  that,  at  Onsala,  the 
phase  calibration  contribution  to  the  group  delays  is  almost 
totally  removed  when  the  correction  for  a  spurious  signal  is 
made.  At  Haystack,  however,  there  still  appear  to  be  residual 
oscillations  in  the  later  part  of  the  experiment.  There  are 
several  reasons  for  this  behavior.  Firstly,  the  formulation 
we  presented  for  estimating  the  magnitude  of,  and  correcting 
for,  a  spurious  signal  was  linearized.  The  ratio  of  the 
spurious  signal  strength  to  the  calibration  signal  strength  in 
the  8.54  GHz  channel  at  Haystack  was  0.21.  This  ratio  is 
sufficently  large  for  the  linearization  to  introduce  sizable 
errors.  (The  second  order  terms  will  be  o((0.21)^),  implying 
that  the  correction  could  be  in  error  by  as  much  as  =20%.) 
Secondly,  we  have  assumed  that  the  spurious  signal  is  constant 
in  both  amplitude  and  phase;  this  need  not  be  the  case.  A 
comparison  of  Figures  3. 2. 8. a  and  b  indicates  that  at  Haystack 


Phase  calibration  signal's  amplitude  as  a  function  of  its  phase 
(same  as  Figure  3.2.5  except  for  Onsala) 
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The  effects  of  the  spurious  signal  on  the  phase  calibration  group  delay 
(see  Figure  3.2.1  for  details  of  symbols). 
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the  spurious  signal  may  not  be  constant,  i.e.,  at  times  the 
correction  seems  to  be  too  small  and  at  other  times  too  large, 
indicating  that  the  spurious  signal  may  be  varying  with  time.) 

The  large  magnitude  of  the  spurious  signal  at  Haystack 
seems  to  be  anomalous  (the  effect  of  the  spurious  signal  is 
three  times  larger  at  Haystack  than  at  Onsala) .  Consequently, 
we  have  not  developed  any  software  which  will  handle  the  non¬ 
linear  problems  or  the  possibility  of  time  variations  of  the 
spurious  signal. 

In  Figure  3.2.9,  we  show  the  residual  phases  at  Haystack 
before  and  after  the  correction  for  the  presence  of  the  spuri¬ 
ous  signal.  Again,  we  see  that  the  systematic  trends  in  the 
residual  phases  are  lessened  by  the  correction,  but  not 
totally  removed. 

We  have  so  far  discussed  the  detection  and  correction  of 
the  spurious  signal  effects,  but  we  have  not  discussed  its 
origin.  Unfortunately,  the  exact  origin  of  the  signal  has  not 
been  determined,  although  there  are  many  possibilities.  The 
reason  for  the  many  possible  origins  is  simple:  almost  every 
electronic  component  in  a  radio  telescope  is  controlled  by  the 
5  MHz  signal  generated  by  the  hydrogen  maser,  e.g.,  the  pulse 
generator,  the  local  oscillators  in  the  receiver  and  the  video 
converters,  and  the  time  synchronization  of  the  tape  record¬ 
ers.  Consequently,  there  is  a  high  probability  that  one  or 
more  of  these  pieces  of  equipment  generated  the  spurious 
signal . 

After  the  spurious  signal  was  detected,  guidelines  were 


Figure  3.2.9  The  effects  of  the  spurious  signal  on  the  Haystack  residual  phases 
(see  Figure  3.2.1  for  details  of  symbols). 


Continued. 


set  up  to  check  for  the  presence  of  spurious  signals  in  the 
output  of  the  receiver  and  to  ensure  better  isolation  of  the 
different  pieces  of  equipment  (Rogers,  1981).  The  effective¬ 
ness  of  these  guidelines  will  be  assessed  when  data  taken 
after  the  guidelines  were  issued  can  be  fully  analyzed.  An 
initial  analysis  of  the  two  multibaseline  intercontinental 
experiments  (June  1982  and  December  1982) ,  performed  after  the 
introduction  of  the  quidelines,  has  not  detected  any  spurious 
signals. 

In  this  section,  we  have  tried  to  demonstrate  methods  for 
studying  the  Mark  III  performance  by  analyzing  the  system's 
calibration  data.  The  spurious  signal  problem  indicates  the 
care  that  must  be  taken  when  accurate  observations  are 
attempted  with  a  complex  piece  of  equipment.  We  will  consider 
the  behavior  of  the  phase  calibration  system  again  in  Section 
4.3  when  we  are  analyzing  the  prediction  of  phase  delay 
ambiguities  from  group  delays. 


3.3  Group  delay  closures 

We  now  discuss  the  final  quality  check  which  we  will 
investigate  in  this  chapter  --  the  properties  of  the  sum  of 
the  delays  around  triplets  of  baselines.  These  sums  will  be 
referred  to  as  delay  closures.  In  this  section  we  will  study 
the  closures  of  the  group  delays  because  these  are  the  delays 
which  are  used  in  geodetic  data  processing.  As  with  earlier 
sections  in  this  chapter,  we  will  study  some  specific  examples 
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of  this  quality  check,  but  we  will  not  be  able  to  present  all 

of  the  results  which  have  been  obtained  thus  far. 

Delay  closures  around  triplets  of  baselines  have  been 

used  as  a  quality  check  since  the  first  multibaseline  Mark  I 

VLB I  experiments  (Whitney,  1974,  pp.  276-279).  The  delay 

misclose,  At  ,  around  a  triplet  of  baselines  is  given  by 
m 


(ibid,  p.  277) 


At 


m 


*12  +  *23~  *13  +  *12*23 


(3.3.1) 


where  ?12,  t13,  t23  are  the  observed  delays  (either  phase  or 
group)  for  the  three  two-element  interferometers  formed  by  the 

A 

pairs  of  tne  three  sites,  and  t23  is  the  observed  phase  delay 
rate  for  the  interferometer  formed  by  Sites  2  and  3.  The 
final  term  in  Equation  (3.3.1)  accounts  for  the  epoch  of  the 
delay  measurement  being  referred  to  the  time  of  arrival  of  a 
signal  at  the  "first"  site  in  an  interferometer  (see  Whitney, 
1974,  p.  276,  and  Appendix  B) .  The  method  we  have  adopted  to 
evaluate  Equation  (3.3.1)  is  not  direct  computation.  We  have 
instead  used  the  property  of  the  theoretical  delays  computed 
by  VLB I  analysis  software;  these  delays  sum  to  zero  around  a 
triplet  of  baselines  (when  they  are  referred  to  the  same 
epoch)  ,  i  .e  . 


t  ,t  _ 
t12*23= 


0 


(3.3.2) 


where  the  superscript  t  refers  to  the  theoretically  calculated 
delays.  If  we  now  subtract  Equation  (3.3.2)  from  Equation 
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(3.3.1),  we  obtain 


Ax 


m 


+ 


+  (x23“‘c23) 
(*23_*23)t12* 


(‘e13_T13)  +  (t12"t12 


If  we  now  introduce  the  postfit  residuals,  v^ j= (x^ j-Xj j)  ,  then 
the  above  equation  reduces  to 


Ax 


m 


v12  +  v23 


-  V 


13+  V12*23  +  *23T12 


(3.3.3) 


The  final  two  terms  in  the  above  equation  are  very  small  and 
can  be  neglected  in  the  calculation  of  the  closures.  (For 
Vi2sl*0  nsec,  ^23=**°  psec/sec,  the  maximum  contribution  to 
the  closure  calculation  of  the  final  two  terms  will  be  0.002 
psec  and  0.020  psec  when  t23*«2.0xl0“6  sec/sec  and  x^2“0.02 
sec.  The  latter  two  values  are  upper  bounds  for  Earth  based 
VLB I . ) 

When  the  final  two  terms  of  Equation  (3.3.3)  are  neglect¬ 
ed,  the  calculation  of  the  delay  miscloses  reduces  to  simply 
summing  the  postfit  residuals  around  baseline  triplets. 

In  Figure  3.3.1,  we  show  the  delay  closures  at  both  X- 
and  S-band,  for  the  radio  source  3C  345,  on  two  triplets  of 
baselines.  The  error  bars  shown  are  calculated  from  the  sum 
of  the  variances  of  delays  from  each  baseline  in  the  triplet 
and  are  shown  at  typical  points.  We  show  in  Appendix  C  that 
the  variance  of  the  misclose  calculated  in  this  manner  should 
be  an  overestimate,  due  to  correlations  between  the  observa¬ 
tions  on  each  baseline,  by  less  than  1%. 


The  X-band  miscloses  shown  in  Figure  3.3.1  do  not  show 


any  particular  trends  although  their  scatter  is  1.3  tiroes 
greater  than  we  would  expect  from  the  standard  deviations. 

The  S-band  results,  however,  do  show  systematic  trends.  These 
miscloses  could  be  due  to  the  brightness  distribution  of  the 
radio  source  (see,  e.g.,  Cotton,  1979;  Shapiro  et  al.,  1979). 
However,  currently  available  brightness  distributions  for  this 
source  (Breidenthal ,  1982)  do  not  remove  these  trends.  The 
origin  of  these  miscloses  is  still  being  investigated  (see 
below).  In  Section  4.3,  we  will  investigate  the  complications 
these  miscloses  will  cause  when  we  attempt  to  predict  phase 
delay  ambiguities  from  group  delays. 

To  determine  a  possible  origin  for  the  S-band  group  delay 
closure  errors  we  need  to  look  at  the  Mark  III  hardware.  In 
Appendix  F  we  develop  a  model  for  the  effects  of  "leakage”  of 
lef t-circularly  polarized  radiation  into  the  output  of  a  feed 
horn  which  is  nominally  meant  to  receive  r ight-circularly 
polarized  radiation.  (This  model  was  originally  suggested  by 
Dr.  A.E.E.  Rogers  at  the  Haystack  Observatory.)  The  contri¬ 
bution  of  the  lef t-circularly  polarized  radiation  to  the  group 
delays,  At^,  is  (for  an  unpolarized  radio  source;  see  Equation 
F.5)  , 

Ata  =  At^  sin(2A<!>  + 

where  At^  is  a  nominally  constant  delay  which  is  function  of 
(i)  the  ratio  of  the  power  from  the  lef t-circularly  polarized 
to  the  power  from  right-circularly  polarized  radiation  at  the 


18- 


Figure  3.3.1  Group  delay  closures  for  observations  of  radio  source 
3C  345  on  July  27-28,1980. 


X-band 

Baselines  HA-HR  -*■  HR-EF  EF-HA  Baselines  HA-EF  EF-ON  -*•  ON-HA 


Group  delay  closure  error  (nsec) 


S-  BAND  DATA 


Baselines  HA-HR  +  HR-EF  ■+  EF-HA 


Group  delay  closure  error  (nsec) 


Baselines  HA-EF  ♦  EF-ON  ON-HA 


Group  delay  closure  error  (nsec) 


Group  delay  closure  error  (nsec) 


Error  bars  are  shown  only  at  typical  points. 

Symbols  HA  Haystack,  EF  Effelsberg,  HR  HRAS  085,  ON  Onsala  (See  Table  5.1) 
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output  of  the  feed  horn,  and  (ii)  the  spanned  bandwidth  of  the 
group  delay  measurement  (see  Appendix  F  for  details);  A<t»  is 
the  difference  of  the  feed  horn  position  angles  at  each  site 
in  the  interferometer  (see  Appendix  D  for  details) ;  and 

is  the  phase  difference  between  the  left-  and 
o 

r ight-ci rcular ly  polarized  radiation  after  these  signals  have 
propagated  through  the  feed  horns  at  the  two  sites  in  the 
interferometer . 

In  Figure  3.3.2  we  show  an  approximate  matching  of  the 
polarization  leakage  model  to  the  group  delay  closure  errors 
for  the  Haystack-Ef felsberg-Onsala  triplet  from  the  27-28  July 
1980  experiment,  assuming  At^  =  0.5  nsec  (approximately 
estimated)  for  the  Haystack-Onsala  interferometer.  The 
polarization  leakage  model  could  explain  many  of  the  apparent 
errors  in  the  group  delays  and  now  needs  to  be  thoroughly 
investigated . 


Universal  time  (days) 


Figure  3.3 


.2  The  effects  of  feed  horn  imperfections  on  the  Haystack- 
Ef felsberg-Onsala  group  delay  closure  errors  for  the 
date  given  in  Figure  3<3«1* 


Feed  horn  imperfection  effect  computed  using 

AtP  =0.5  nsec  and  =  90° 
o  o 

(see  text  and  Appendix  F  for  discussion) 
Error  bars  are  shown  only  at  typical  points. 


4.  ELIMINATION  OF  PHASE  DELAY  AMBIGUITIES 


In  Chapter  3  we  investigated  the  quality  of  the  Mark  III 
VLB  I  estimates  of  the  group  delay,  phase  delay  and  phase 
delay  rate  using  the  redundant  nature  of  the  VLBI  observ¬ 
ation.  In  this  chapter  we  will  continue  to  check  observation 
quality  by  attempting  to  combine  the  group  and  phase  delay 
measurements  to  produce  a  delay  measure  which  has  the  precis¬ 
ion  of  the  phase  delays  and  is  calibrated  for  the  plasma 
delay  through  the  ionosphere.  In  Chapter  2  we  discussed  the 
methods  used  to  estimate  both  the  group  and  phase  delays  from 
the  recordings  of  signals  from  a  radio  source.  The  phase 
delay  was  however  ambiguous  by  multiples  of  2it/w  seconds. 

(The  multiband  group  delay  is  also  ambiguous,  but  the 
ambiguity  spacing  is  much  larger  for  this  delay  measure  than 
for  the  phase  delays.  The  group-delay  ambiguities  can 
normally  be  resolved  using  a  priori  information.)  In  this 
chapter  we  will  investigate  methods  which  could  be  used  to 
remove  the  phase-delay  ambiguities. 

The  sensitivity  of  the  Mark  III  VLBI  system  allows  group 
delay  measurements  to  be  made  with  uncertainties,  due  to 
random  noise,  sometimes  as  low  as  a  few  picoseconds  and  quite 
often  lower  than  30  psec.  This  sensitivity  is  achieved  by 
using  wideband  receivers  (400  MHz  at  X-band,  =8.4  GHz,  and 
100  MHz  at  S-band,  =2.2  GHz)  and  multi-track  2  MHz-bandwidth- 
per-track  tape  recorders.  With  group-delay  uncertainties  of 


less  than  30  psec  at  X-band  and  100  psec  at  S-band  it  should 
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be  possible  to  determine  estimates  of  the  phase  delays  at 
X-band  and  S-band  which  are  sufficiently  accurate  that  we  can 
resolve  the  phase-delay  ambiguities  of  *120  psec  at  X-band 
and  *440  psec  at  S-band.  However,  the  signals  which  have 
been  recorded  at  each  site  have  propagated  through  a  disper¬ 
sive  medium,  mainly  the  Earth's  ionosphere,  and  therefore  the 
group  and  phase  delays  will  not  be  equal.  In  order  to  be 
able  to  predict  the  phase  delays  we  will  have  to  calibrate 
the  dispersive  delay.  In  Section  4.1  we  will  investigate  the 
models  which  are  used  to  calibrate  the  dispersive  delay.  We 
will  consider  both  the  approximations  made  in  using  the 
models  and  the  methods  which  should  be  used  to  apply  these 
models  to  wideband  VLBI  observations. 

The  prediction  of  phase-delay  ambiguities,  based  solely 
on  group  delay  observations,  is  an  experimental  technique. 

We  therefore  require  a  method  to  check  its  validity.  With 
this  in  mind.  Section  4.2  will  concentrate  on  developing  the 
prediction  algorithms  using  observations  which  can  be  "phase 
connected"  without  the  use  of  the  group  delays.  These  data 
which  usually  are  observations  of  a  pair  of  radio  sources, 
closely  spaced  in  the  sky,  may  be  "phase  connected"  because 
of  the  short  duration  of  time  (<8  mins)  between  observations 
of  each  of  the  sources.  Over  this  short  interval  of  time,  we 
can  model  the  behavior  of  the  phase  to  within  much  less  than 
a  cycle  allowing  sequential  observations  to  be  "phase 
connected"  to  each  other,  provided  there  are  no  large  data 
gaps.  This  technique  has  been  used  successfully  in  the  past 
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to  obtain  phase  delays  (Wittels,  1975;  Shapiro  et  al . r  1979). 
In  Section  4.3,  we  will  attempt  to  apply  the  techniques 
developed  in  Section  4.2  to  a  standard  geodetic  schedule. 

4.1  Ionospheric  propagation  delay  models 

In  this  section  we  will  derive  the  frequency  dependence 
of  the  delay  caused  by  the  propagation  of  a  signal  throuqh 
the  ionosphere.  While  deriving  the  frequency  dependence  we 
will  neglect  several  effects  which  describe  the  interaction 
between  the  ionosphere  and  the  propagating  electromagnetic 
waves.  In  Section  4.1.2  we  will  investigate  the  possible 
magnitude  of  the  errors  in  the  dual  frequency  calibration 
system  due  to  the  neglect  of  these  additional  terms. 

We  model  the  ionosphere  as  a  plasma  of  electrons  with  a 

3 

density  of  Nv  electrons/m  which  are  in  motion  around  fixed 
positive  ions.  The  macroscopical ly  averaged  (i.e.f  averaged 
over  a  volume  large  compared  to  the  volume  of  an  atom) 
electric  dipole,  (see,  e.g.,  Jackson,  1975,  Section  I  and 
Chapter  4.1),  for  a  plasma  is: 

=  -  Nvex  ,  (4.1.1) 

-19 

where  e  is  the  charge  of  an  electron  (1.6x10  Coul)  and  x 
is  the  average  (vector)  displacement  of  the  electrons  from 
their  fixed  ions.  In  the  presence  of  an  electric  field  , 
the  equation  of  motion  of  a  free  electron  in  the  ionosphere 


*/  ’->«  i 


>  f.a 


is  given  by  (Jackson  1975, p.  285  and  p.  292): 


n»ex  -  (e/c)  Bqxx  -  meyex  =  -eEt  (4.1.2) 

where  x  and  x  are  the  average  velocity  and  the  average 
acceleration  of  the  electron,  me  is  the  mass  of  the  electron 
(9.11xl0-31  kg),  c  is  the  velocity  of  light,  §Q  is  the  mag¬ 
netic  induction  of  the  Earth's  magnetic  field  and  y  is  a 
damping  constant  (we  will  discuss  the  meaning  of  the  damping 
constant  in  Section  4.1.2.).  If  we  neglect  the  effects  of 
the  magnetic  field  terms  and  damping  terms  in  Equation 

(4.1.2),  the  equation  of  motion  reduces  to 


m  x  =  -eE .  . 

e  t 


Incorportion  of  Equation  (4.1.1)  into  the  above  equation 
yields  a  relationship  between  the  electric  dipole  and  the 
electric  field  given  by 
m 

(4.1.3) 


N-l  ^t  =  eh  • 


We  will  now  assume  that  Efc  is  a  time-harmonic  field,  i.e. 
Et(x,t)  =  E(x)e~lwt,  where  u  is  angular  frequency.  For  this 
choice  of  field  type.  Equation  (4.1.3)  can  be  solved  yielding 
2 

(4.1.4) 


N  e 

P  - - E 


m  w 
e 


where  P  is  the  positional  part  of  the  time-harmonic  electric 
dipole:  Pfc(x,t)  =  P(x)e-1U)t. 

Equation  (4.1.4)  can  now  be  used  to  determine  the  per¬ 
mittivity  tensor,  t,  in  region  x.  The  permittivity  tensor 
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relates  the  electric  field  E  to  the  electric  displacement 
field  D  by  5  =  eE.  The  electric  displacement  field  is  also 
given  by  5  =  eQE  +  P  (Jackson,  1975,  p.  14),  where  eQ  is  the 
permittivity  of  free  space.  The  properties  of  the  iono¬ 
sphere  when  modeled  by  Equation  (4.1.3)  are  isotropic  and  the 
permittivity  tensor  reduces  to  e  =  cf  where  T  is  a  unit 
tensor  and  e  is  the  dielectric  constant  of  the  ionosphere. 

The  combination  of  the  two  expressions  for  the  electric 
displacement  field  yields  the  dielectric  constant  for  the 
ionosphere  (remember  the  expression  for  this  dielectric 
constant  is  only  approximate  because  of  the  terms  in  Equation 
(4.1.1)  which  were  neglected): 


e  =  e0(!  - 


Nve 


) 


(4.1.5) 


meEo“ 


We  may  now  use  the  dielectric  constant  to  estimate  the 
phase  velocity  of  electromagnetic  waves  propagating  through 
the  ionosphere.  For  plane  waves,  the  electric  field  is  given 


by  EQexp [ iR »x-i wt] ,  where  R  is  the  wave  vector.  For  a  wave 


to  propagate  through  a  medium,  the  dispersion  relationship 
between  k  and  u>  must  be  satisfied.  For  plane  waves  propa¬ 
gating  through  an  isotropic  medium  the  dispersion  relation¬ 
ship  between  R  and  on  is  (ibid  p.  271): 


2 

(i)  H0e 


where  no  is  the  permeability  of  free  space.  The  phase  veloc¬ 
ity  of  the  wave  is  given  by  V  =  <,>/ 1 R  |  .  The  combination  of 

r 

Equation  (4.1.5),  the  dispersion  relationship  and  the 


definition  of  phase  velocity  yields 


vp  -  C 


T  1  /2 


1  - 


Nve 


mew 
e  o 


(4.1.6) 


where  c  is  the  free  space  velocity  of  propagation  (cz  = 
l/^o eQ) •  We  can  write  the  expression  for  the  phase  refrac¬ 


tive  index  of  the  ionosphere,  np,  directly  from  Equation 


(4.1.6) 


np  = 


1  - 


N  e 
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m  e  u 
e  o 
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(4.1.7) 
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where  we  have  substituted  Up  *  Nye  /meeQ  in  the  second  equal¬ 


ity  in  Equation  (4.1.7);  Up  is  the  plasma  frequency. 


Equation  (4.1.7)  gives  the  expression  for  the  phase 
refractive  index  of  the  ionosphere.  Before  evaluating  the 
expression  for  the  propagation  delay  through  the  ionsphere  we 
will  make  one  further  approximation  in  the  expression  for  the 
phase  refractive  index;  we  will  expand  the  square  root  in 
Equation  (4.1.7)  to  first  order,  i.e.  we  will  approximate  np 
by 

(4.1.8) 


i  2/0  2 

np  =  1  _  U)p/2U 


(The  effects  of  the  neglect  of  the  second  order  term  in  the 
expansion  will  be  studied  in  Section  4.1.2.) 

The  difference  in  the  propagation  delay  between  the 
free-space  propagation  and  the  propagation  through  the  iono¬ 
sphere  will  be  given  by 


Ip  *  /  (n  (s)/c)  ds  -  /  (1/c)  ds 


(4.1.9) 


where  the  integration  path  is  shown  in  Figure  4.1.1.  (The 


variation  of  Op  with  height,  shown  in  this  figure,  is  only 

meant  to  illustrate  the  general  behavior  of  n  .  The  values 

P 

of  np  along  the  ray  path  should  be  used  to  evaluate  the 
integrals  in  Equation  (4.1.9).)  We  will  refer  to  1^  as  the 
"ionospheric  phase  delay".  (Strictly  the  line  integral  given 
above  is  only  valid  if  the  the  wavelength  of  the  variations 
of  Op(s)  are  large  compared  to  the  Fresnel-zone  diameter  at 
the  height  of  the  ionosphere.  We  will  discuss  the  Fresnel- 
zone  diameter  in  detail  in  Section  4.1.2.) 

Substitution  of  Equation  (4.1.8)  into  the  above  integral 
yields 

2 

IP  =  — - 2  /  Nvds 

2cm  Eo)  0 
e  o 

We  will  define  the  integrated  electron  content  to  be 
sQ  — 

N.  =  |  N  ds 

1  o  v 

which  yields  the  expression  for  the  ionospheric  phase  delay 


2 

e  N  . 

s _ L 


2cm  t  oj 
e  o 


(4.1.10) 


The  variations  of  Ny  with  altitude,  location,  time  of 
day  and  time  of  year  are  too  erratic  for  to  be  accurately 
determined  using  (non- 1 ine-of-sight)  measurements  or  predic¬ 
tions  of  Nv.  (Measurements  of  the  integrated  electron  con¬ 
tent  could  be  used,  if  these  measurements  were  made  at  the 
radio  telescopes  and  in  the  direction  of  the  radio  source.) 


However,  Equation  (4.1.10)  suggests  a  method  we  can  use  to 
avoid  directly  determining  N ^ .  Measurements  of  the  propaga¬ 
tion  delay  through  the  ionosphere  at  two  widely  spaced 
frequencies  can  be  used  to  infer  N^.  This  technique  is  used 
with  dual  frequency  Mark  III  receivers  to  calibrate  the 
differential  path  delay  through  the  ionosphere. 

We  may  derive  the  "ionospheric  group  delay"  from 
Equation  (4.1.10).  If  we  multiply  Equation  (4.1.10)  by  w  we 
obtain  the  change  in  phase,  4*1,  due  to  the  propagation 
through  the  ionosphere: 
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Differentiation 
frequency  will  y 
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o 

of  Equation  (4.1.11)  with  respect 
ield  the  ionospheric  group  delay, I 


to 

g 


(4.1.11) 


angular 


I9  =  +  - 1-~2  (4.1.12) 

2cm  e  w 
e  o 

Strictly  we  should  include  a  term  involving  SN^/du)  which 
accounts  for  the  difference  in  the  integration  path  at  dif¬ 
ferent  frequencies.  We  will  investigate  this  path  dependence 
in  Section  4.1.2. 

Equations  (4.1.10)  and  (4.1.12)  give  the  frequency 
dependence  of  the  ionospheric  phase  and  group  delays.  In  the 
following  section  we  will  investigate  the  method  used  to 
calibrate  the  ionospheric  delays  using  dual  frequency  mea¬ 
surements.  In  Section  4.1.2.  we  investigate  the  magnitude  of 


the  errors  in  the  dual  frequency  calibration  due  to  the 


VLB I  observations 


In  Section  4 . 1  we  derived  the  frequency  dependence  of 
both  the  group  and  phase  delays  due  to  propagation  of  a 
signal  through  the  ionosphere.  In  this  section  we  will 
investigate  the  methods  used  to  calibrate  the  ionospheric 
delay  by  exploiting  its  frequency  dependence.  The  algorithm 
to  be  used  is  not  as  trivial  as  it  first  appears.  The  iono¬ 
spheric  delay  at  X-band  is  usually  less  than  1  nsec.  It  can 
be  easily  calculated  that  if  the  ionospheric  delay  is  1.0 
nsec  at  8.0  GHz,  then  at  8.4  GHz  the  ionospheric  delay  will 
be  0.9  nsec.  Hence,  there  is  variation  of  =<0.1  nsec  (  =  3  cm) 
in  the  ionospheric  delay  over  the  frequency  range  spanned  by 
the  synthesized  band.  In  this  section  we  will  investigate  a 
method  for  compensating  for  the  variation  in  the  ionospheric 
delay  over  the  synthesized  band.  We  will  use  the  equivalence 
of  maximizing  the  delay  resolution  function  and  least  squares 
to  determine  the  relationships  between  the  estimates  of  the 
group  delay,  phase  delay  rate  and  visibility  phase,  and  the 
integrated  electron  content  and  its  rate  of  change.  We  will 
then  determine  an  algorithm  for  calculating  the  effective 
frequencies  for  the  group  delay,  phase  delay  and  visibility 
phase  measurements. 

VLBI  observations  are  difference  measurements,  i.e.  the 


group  delay  is  the  difference  in  the  arrival  times  of  signals 
from  a  radio  source  at  two  antennas.  Consequently,  VLBI 
observations  cannot  determine  the  total  integrated  electron 
contents  along  the  ray  paths  from  the  radio  source  to  each 
telescope.  Only  the  difference  in  the  integrated  electron 
content  can  be  determined.  Hence,  the  contribution  to  the 
estimated  cross  spectral  phases  from  the  propagation  of  the 
signal  through  the  ionosphere  will  be,  from  Equation 
(4.1.11) , 

i  e2i[)i 
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where  AN^  is  the  difference  in  the  integrated  electron 
contents  along  the  propagation  paths  from  the  radio  source  to 
the  two  sites  in  the  sense  AN.  =  (N^2  -  (N.)^,  where  the 
subscripts  1  and  2  denote  the  two  sites.  For  the  duration  of 
an  observation  (between  100  sec  and  400  sec),  we  may  model 
AN^  by 

AN. (t)  =  AN?  +  AN. (t-t  ) 
l  110 

where  AN°  is  the  integrated  electron  content  at  the  epoch, 
tQ,  of  the  observation,  and  AN^  is  the  rate  of  change  of  the 
integrated  electron  content  at  that  epoch. 

The  effect  of  the  ionospheric  delay  on  the  cross  spec¬ 
tral  phases  from  an  accumulation  period  (see  Section  2.2)  as 
a  function  of  time  and  frequency  will  be 

A*^  =  -  K  ( AN?  +  AN.  (t  -tQ)  ] /ok  (4. 1.1.1) 

where  q  is  an  index  denoting  the  epoch  of  the  accumulation 


2 

period,  j  is  an  index  denoting  frequency,  and  K  =  e  /(2cmeeo) 
=  5.2xl0~6  m2s-1 . 

In  Appendix  A,  we  see  that  the  model  used  to  relate  the 
cross  spectral  phases  to  the  estimates  of  the  group  delay, 
the  phase  delay  rate  and  the  visibility  phase  is  (Equation 
A. 3)  : 


♦«  “  <W-Mo>Tg  +  UtP(t-tO)  +  *t 

In  the  discrete  form  of  Equation  (4. 1.1.1)  the  above 
equation  becomes 


♦qj  =  ( u) j ~0)o )  +  wjxp(tq-t0)  +  (4. 1.1. 2 

We  now  will  investigate  the  relationship  between  the 
estimates  of  xq,  f  and  4>t  obtained  from  the  model  given  by 
Equation  (4. 1.1. 2)  and  the  integrated  electron  content,  An?, 
and  the  time  derivative  of  the  integrated  electron  content. 


From  a  single  VLBI  observation  the  cross-spectral  phases 
will  be  estimated  at  Q  epochs  and  at  J  frequencies.  For  a 
typical  observation,  Q  will  of  the  order  of  50  (for  a  100  sec 
observation  with  2  sec  accumulation  periods)  and  J  will  be 
approximately  four.  For  these  QJ  estimates  of  the  cross 
spectral  phases,  Equation  (4. 1.1.2)  may  be  written  as  a 
matrix  equation  of  the  form 

1  =  A  £  (4. 1.1. 3) 

T 

where  £  =  (4>t,  Xg  r  xp)  and  the  QJ  rows  of  A  will  be  given  by 

(If  u>j-to0f  u)j(tq-tQ)],  j  =  l+j,  q*l+Q.  The  weighted  least 


squares  estimates  of  £  given  £  will  be 


_  T  -1  -1  T  -1 

£  =  (A  V  A)  A  V  £ 


(4. 1.1. 4) 


T 

where  V  is  the  covariance  matrix  of  £  i.e.  V  =  <£  £  >  where  £ 
is  the  vector  of  noise  contributions  to  £.  (For  those 
readers  unfamiliar  with  weighted  least  squares,  this 
technique  is  reviewed  in  Appendix  E.) 

We  will  assume  that  all  of  the  cross  spectral  phase 
estimates  have  the  same  variance  and  that  the  covariances 
between  the  estimates  are  zero.  With  these  assumptions  the 
covariance  matrix  cancels  in  Equation  (4. 1.1. 4)  and  the 
equation  reduces  to 


£  =  (ATA)  1AT  £  (4. 1.1. 5) 

We  may  evaluate  Equation  (4. 1.1. 5)  analytically  when  the 

observed  phases  are  given  by  Equation  (4. 1.1.1).  (The  linear 

properties  of  the  estimator,  Equation  4. 1.1.5,  allow  us  to 

study  the  ionospheric  delay  independently  of  the  other  delays 

which  may  be  present  in  the  observations.)  We  will  first 
T 

evaluate  A  A  which  will  is  by 


T 

A  A  = 


Q.T 


J 

J 

0 

°jI1(wj“wo) 

l 

j  =  l 

J! 

q=l 

e  J^-v2 

J 

A 

0 

J: 

J 

0 

symmetr  ic 

i— J- 

II 

1 

a=l 

2  2 


We  may  simplify  the  above  matrix  by  choosing  tQ  such  that  it 
is  at  the  center  of  the  observation.  The  epochs  of  the 
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accumulation  periods,  t  ,  are  equally  spaced  and  t  -t  may  be 

M  **1 

written  as  t  -t  =  (q-q  ) At  ,  where  At  is  the  duration  of  an 
q  o  c  a  a 

accumulation  period  (typically  2  sec)  and  qc  is  the  index 
corresponding  to  fcQ;  qc  is  not  necessarily  an  integer.  If 
the  epoch  of  the  observation  is  taken  as  the  center  of  the 


will  be  given  by 


T  -IT 
[ (A  a)  A1 ] 


col 


m 


J  2  J  "| 

Q  l  Au./DET  -  Q  l  (Aw./DET)  Aw. 
j=l  3  j=l  3  * 

J 

-Q  l  (Aw-i/DET)  +  (QJ/DET)  Aw,, 
j  =  l  3  K 

3  2  -1 
t  ^UjQfQ'  -D/121  uk(m-qc)Ata 

(4. 1.1. 6) 


for  the  phase  from  the  accumulation  period  with  epoch 

Atg  relative  to  the  center  of  the  observation  and  at 
frequency  wk .  (m  denotes  the  sequential  number  of  the 
accumulation  period,  i.e.  m  is  a  specific  value  of  the  index 
q;  see  Equation  (4. 1.1.2)  and  Appendix  B  for  more  details.) 

We  should  note  that  the  expression  (4. 1.1. 6)  is  only 
dependent  on  the  form  of  the  mathematical  model  used  to 
express  the  cross  spectral  phases  as  functions  of  the  group 
delay,  phase  delay  rate  and  visibility  phase.  We  may  now 
find  the  relationship  between  the  group  delay,  phase  delay 
rate  and  visibility  phase  estimates  for  the  ionospheric 
signature  in  the  cross  spectral  phases  given  by  Equation 
(4. 1.1.1) . 


The  ionospheric  delay  contributions  to  the  values  of  the 
VLB  I  observables  may  be  obtained  by  performing  the  multipli¬ 
cation  given  in  Equation  (4. 1.1. 5)  using  Equations  (4.1.1. 1) 


and  (4. 1.1. 6).  The  product  is 


-KAN°[(  l  Au>!?)(  l  l/u.k)  -(  l  Aw.  )(  l  Ao)k/t»)k)] 
ii  _  j  =  l  k=l  j=l  J  k=l 

♦t  '  - - 3 — : - r - : - 

~  (.LAuj) 

3=1  J  3=1  J 


(4. 1.1. 7. a) 


- l 

= 


-KaNf  t-(  J  l/«k)  +  J  I  4vV 

3=1  k=l  k=l 

J  2  J  2 

J  I  Au>  ■  -  (  I  Aw^  ) 

j=l  3  j=l  3 


.  l  _ 
TP 


-KAN  .  J 


°  2 

ill 


(4. 1.1. 7. b) 


(4. 1.1. 7.c) 


where  Xg  and  Tp  are  the  contributions  to  the  visibility 
phase,  the  group  delay  and  the  phase  delay  rate,  respective¬ 
ly,  from  the  ionospheric  delay  and  rate. 

We  should  notice  that  the  delay  estimates  and 

t  g 

depend  only  on  the  integrated  electron  content  at  the  eooch 
of  the  observation  and  not  on  the  rate  of  change  of  the 
integrated  electron  content.  Similarly  the  phase  delay  rate 

*  i 

estimate,  x*,  depends  only  on  the  rate  of  change  of  the  inte¬ 
grated  electron  content.  This  decoupling  of  the  delay  and 
rate  estimates  follows  from  the  choice  of  the  epoch  of  the 
observation  to  be  at  the  center  of  the  data  duration. 


We  may  rewrite  Equations  (4. 1.1.7)  as 


it  =  -KAN?/<4ff 


(4. 1.1. 8. a) 


•  \"  •  *•  *  «  *•  .  .  •  *„*  %  .*»•  •  *»'  *  * 
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Tg  *  +KAN?/(w|ff)2  (4. 1.1. 8. b) 

A 

*p  =  -KAN./(w*ff)2  (4. 1.1. 8. c) 

where  “eff  and  “eff  are'  respectivley ,  the  effective 

frequencies  of  the  visibility  phase,  the  group  delay  and  the 
phase-delay  rate  estimates  for  calibrating  the  ionospheric 
delay.  By  equating  Equations  (4. 1.1.7)  and  (4. 1.1.8)  we  may 
determine  analytic  expressions  for  the  effective  frequencies 


w 


eff 


<7  9  <7  2 

J  I  Auj  -  (  J  AwJ 
3*1  J  W  J _ 


o  2  °  u  u 

I  Aw.  J  (1/w.  )  -  l  Aw.  £  Aw.  / w. 
j  =  l  -'k*!  K  j  =  l  -,k  =  l  K  K 


(4. 1.1. 9. a) 
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3 _ 3  =  1.  .3 


W  U  U 

l  Aw.  £  (1/w.  )  -  J  l  Aw  /w. 
j=l  Jk=l  K  k=l  K  K 


1/2 


(4. 1.1. 9. b) 


“eff 


-  (.1  «j/J  )W2 

j  =  l  3 


(4. 1.1. 9. c) 


In  Section  4.2  when  we  are  developing  the  phase  delay 
prediction  techniques,  we  will  be  using  phase  delays  rather 
than  the  visibility  phases.  We  may  easily  obtain  the  effec¬ 
tive  frequency  of  this  delay  measure  from  the  above 
equations.  The  contribution  of  the  ionospheric  delay  to  the 
phase-delay  estimate  will  be 
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and  hence  from  Equation  (4. 1.1. 7. a)  we  have  that 


In  terms  of  effective  frequencies,  the  equivalent 
expression  to  the  expression  given  above  is 

Tp  =  -KAN°/(w£ff)2  (4.1.1.10) 

and  therefore  by  comparing  the  above  two  equations  we  have 


The  expressions  for  effective  frequencies  do  have  a 

simple  interpretation  which  is  shown  in  Figure  4. 1.1.1.  The 

group  delay  effective  frequency  is  the  frequency  at  which  the 

tangent  to  the  phase  versus  frequency  curve  of  the  dispersive 

medium  is  parallel  to  the  straight  line  determined  by  the 

least  squares  fit  to  the  discretely  sampled  points  on  the 

phase  versus  frequency  curve.  (Each  of  the  points  shown 

represents  an  average  of  the  phase  across  the  4  MHz  recorded 

bandwidth.  It  can  be  easily  shown  that,  for  an  integrated 

18  2 

electron  content  of  10  electrons/m  ,  the  curvature  of  the 
phase  verus  frequency  curve,  at  2.3  GHz,  deviates  from  a 
straight  line  by  a  maximum  of  0?016.  For  our  purposes  we  can 
treat  each  channel  as  a  discrete  frequency.  The  slope  of  the 


Figure 


4. 1.1.1  Graphical  representadon  of  the  phase  and  group  delay 
effective  frequencies. 


phase 


frequency 


phase 


Ionospheric  delay 
dispersion  curve 


phase  across  the  recorded  bandwidth  is  absorbed  into  the 

single  band  delay  estimate;  see  Appendix  B,  Equation  (B.3.3), 

for  discussion.  If  we  wished  to  account  for  the  curvature  of 

the  phase  across  the  recorded  bandwidth,  we  could  extend 

Equations  (4. 1.1.2)  and  (4. 1.1. 3)  to  include  the  singleband 

delay.  At  this  time,  such  an  extension  does  not  seem 

warranted.)  The  phase  delay  effective  frequency  is  the 

frequency  at  which  a  straight  line  from  the  origin  to  the 

least  squares  estimate  of  the  visibility  phase  intersects  the 

phase  versus  frequency  curve  for  the  ionospheric  delay 

dispersion.  (We  should  note  that  the  curvature  of  the  phase 

versus  frequency  curve  is  very  exaggerated  in  Figure  4. 1.1.1. 

The  deviation  of  the  curve  from  a  straight  line,  at  2.3  GHz, 

over  the  spanned  bandwidth  of  75  MHz,  is  less  than  1°,  for  an 

18  2 

integrated  electron  content  of  10  electrons/m  .  We  could 
attempt  to  estimate  the  dispersive  delay  and  the  non-dispers- 
ive  delay  from  a  single  frequency  band,  but  these  estimates 
would  be  highly  correlated  and  have  very  low  precision.  (For 
the  frequency  sequence  2.295  GHz,  2.270  GHz,  and  2.220  GHz, 
and  phase  measurements  of  with  standard  deviations  of  1°  in 
each  channel,  the  non-d i sper s i ve  delay  estimate  would  have  a 
standard  deviation  of  6.2  nsec.  If  the  dispersive  delay  is 
not  estimated,  the  standard  deviation  of  the  group  delay 
would  be  >0.05  nsec.  Clearly,  we  can  not  separate  the 
dispersive  and  the  non-dispersive  delays  with  observations 
from  only  one  frequency  band.) 

We  are  now  able  to  develop  an  algorithm  which  will  allow 
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us  to  estimate  the  contribution  of  the  dispersive  delay  to 
the  group-delay  observations  by  using  observations  of  the 
delays  at  two  widely  separated  frequency  bands.  Equations 
(4. 1.1. 8  a-c)  and  Equation  (4.1.1.10)  give  the  contributions 
to  the  observed  delay  and  rate  values  of  a  plasma  with  a 
difference  integrated  electron  density  of  AN?  and  rate  of 
change  of  the  integrated  electron  density  of  An^.  We  will 
new  use  these  relationships  to  develop  an  algorithm  which 
will  allow  us  to  eliminate  the  contributions  of  the  plasma 
delay  and  rate. 

We  will  assume  that  observations  of  the  group  delay, 
phase  delay  and  phase  delay  rate  are  available  at  two  widely 
separated  frequencies.  For  the  Mark  III  system  these  fre¬ 
quencies  are  X-band  (»  8.4  GHz)  and  S-band  (=  2.3  GHz) .  We 

will  denote  these  observations  by  t*,  t*  and  f*  for  the 

2  g  p  p 

X-band  group  delay,  phase  delay  and  phase  delay  rate, 

respectively,  and  t®,  and  t®  for  the  same  quantities  at 

y  h  I? 

S-band.  Normally  the  phase  delay  observations  will  be 
ambiguous  and  would  not  be  used  in  the  analysis.  In  Section 
4.3  we  will  discuss  the  possibilities  for  the  elimination  of 
their  ambiguities.  For  completeness  we  will  develop  the 
algorithms  for  the  elimination  of  the  plasma  contributions  to 
these  delays  at  this  time. 

We  now  wish  to  decompose  each  observed  delay  and  rate 
into  two  parts  -  a  contribution  from  all  of  the  non-disper- 
sive  delays  (i.e,  the  geometry,  clock  epoch  offsets  and  the 
neutral  atmosphere),  -cn,  and  a  contribution  from  the  dispers- 
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ive  plasma  delay.  For  the  delay  observations,  we  will  also 
need  to  include  a  constant  additive  offset  between  each  delay 
measure  to  account  for  (the  nominally  constant)  instrumental 
dispersions  which  are  not  calibrated  by  the  phase  calibration 
system.  (See  Chapter  3  for  details.)  These  offsets  allow  for 
the  delays  at  X-band  and  S-band  to  be  different  even  in  the 
absence  of  a  dispersive  propagation  medium.  An  offset 
between  the  group  and  the  phase  delays  must  also  be  included 
because  of  instrumental  dispersions  (see  Section  3.1  which 
discusses  some  of  the  dispersive  properties  of  feed  horns  and 
waveguides) . 

We  will  now  develop  the  relationships  between  the  delay 
measurements,  the  non-dispersive  delay  and  the  dispersive 
delay.  Equation  (4. 1.1. 8. b)  gives  the  contribution  of  the 
ionospheric  propagation  delay  to  the  observed  group  delay 
values.  If  we  include  the  non-dispersive  delay,  the  observed 
group  delay  value  at  X-band,  ,  will  be 

Tg  =  xn  +  KaN°/(w*)2  (4.1.1.12) 


where  w*  is  the  effective  frequency  of  the  X-band  group 

.  X  Cl  , 

delays  (i.e.,  u,g  =  Wgff  computed  for  the  X-band  frequencies 
using  Equation  (4 . 1 . 1 . 19. b) .  We  will  denote  KAn°/(w*)2  by  1^ 
--  the  ionospheric  contribution  to  the  X-band  group  delays. 
Equation  (4.1.1.12)  can  be  written  as 


(4.1.1.13) 
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The  S-band  group  delay  measurements,  will  be  given  by 

s  ,  ,  x,  s.2,  ^s 

T  =  T+  ( U)  /to  )  I  +0 
g  n  q/  g'  x  g 

s 

where  is  the  effective  frequency  of  the  S-band  group  delay 
observations  and  o|  is  a  delay  offset  which  accounts  for  the 
difference  in  propagation  delay  of  the  X-band  and  S-band 
signals  through  the  antenna,  feed  horns  and  waveguides  up  to 


the  injection  point  of  the  phase  calibration  pulses.  We  will 


x  s  2 

denote  (ojg/Wg)  by  P  which  yields 

TS  =  x  +  pi  +  0S 
g  n  x  g 


(4.1.1.14) 


The  relationship  for  the  X-band  phase  delays,  t*', 
will  be  (assuming  right  circular  polarization  (RCP) ) 

Tp'  =  Tn  ~  aIx  +  °p  +  Ui“*2,/wx 

where  a  is  (w*/w*)2,  0*  is  an  offset  between  X-band  group 
delays  and  the  X-band  phase  delays,  and  <l>2  are  the  posi¬ 
tion  angles  of  the  feed  horns  at  each  site  of  the  baseline 
(see  Appendix  D) ,  and  u*  is  the  radio  frequency  of  the  visi¬ 
bility  phase  at  X-band.  (u>x  is  equivalent  to  a>Q  in  previous 
derivations  in  this  thesis.  We  change  its  symbol  now  to 
differentiate  between  X-band  and  S-band  observations.)  The 
analytic  expressions  for  ^  and  <i>2  are  derived  in  Appendix  D. 

We  will  redefine  the  phase  delays  such  that  they  will  be 
referred  to  a  reference  orientation  which  is  fixed  with 

X 

respect  to  inertial  space.  This  new  phase  delay,  will  be 

given  by 

'p  =  Tp'  - 


•; '  -V<  V*V-\>V«V  V ~_V V 


The  expression  for  these  new  phase  delays  will  be 


Tp  =  xn  -  aIx  +  °p 


(4.1.1.15) 


The  S-band  phase  delays  (again,  referred  to  a  constant  refer¬ 
ence  orientation)  will  be 


s  s 

x  -  x  -  Y I  +0 
p  n  1  x  p 


(4.1.1.16) 


x  s  2  s 

where  y  =  (u>  /w  )  ,  to  is  the  effective  frequency  of  the 
y  y  h 


.s  . 


S-band  phase  delays  and  0^  is  the  offset  between  the  S-band 


phase  delays  and  the  X-band  group  delays  in  the  absence  of 
propagation  through  a  dispersive  medium,  aside  from  the 
antenna,  etc. 

If  we  examine  Equations  (4.1.1.12)  through  (4.1.1.15), 
we  see  that  for  each  epoch  there  are  four  measured  quantities 
(if  we  include  the  phase  delays),  and  five  unknowns.  If  only 
the  group  delays  are  considered  there  will  be  two  measure¬ 
ments  and  3  unknowns.  With  each  additional  pair  of  group 
delays  there  are  two  new  unknown  quantities  introduced,  t 


and  Ix,  for  this  new  epoch.  Hence  when  only  the  group  delay 


measurements  are  used,  there  is  no  redundancy.  If  both  the 
group-  and  phase-delay  measurements  are  used,  at  each  addi¬ 
tional  epoch,  four  new  measurements  are  made,  but  only  two 
new  unknowns  are  introduced.  With  observations  from  more 
than  one  epoch,  redundant  information  is  obtained  and  we  may 
use  this  additional  information  to  check  the  quality  of  the 


group-delay  measurements. 

We  will  now  derive  the  algorithms  which  are  used  to 
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estimate  i  and  I  from  the  various  delay  measures.  If  only 

the  group  delays  are  available  we  have  two  equations. 

Equations  (4.1.1.12)  and  (4.1.1.13),  which  can  be  used  to 

s 

estimate  a  linear  combination  of  in,  Ix  and  0g.  Examination 
of  these  equations  yields 

!x  =  (t£-t|)/(1-P)  +  Oj/d-f)  (4.1.1.16) 

If  we  assume  that  0s  is  constant,  but  unknown,  we  see  that 
Equation  (4.1.1.16)  yields  an  estimate  of  the  ionospheric 
delay  plus  a  constant.  Without  an  independent  estimate  of 
either  Ix  or  0g  we  can  not  uniquely  separate  these  two 
quantities.  This  constant  additive  contribution  to  the 
ionospheric  delay  estimate  (if  it  is  indeed  constant)  is  not 
a  severe  problem  because  it  will  be  absorbed  into  the  esti¬ 
mated  clock  offsets. 

An  estimate  of  the  non-dispersive  delay  may  be  obtained 
by 

xn  =  (Px*-x®)  /  0-1)  -  o|/(l-0)  (4.1.1.17) 

Again  we  see  that  we  cannot  separate  uniquely  the  neutral 
delay  from  the  offset  Og/(l-P). 

When  only  the  group  delays  are  used  we  have  no  means  of 
assessing  the  constancy  of  OgS  without  assuming  some  proper¬ 
ties  of  either  tn  and  Ix« 

However,  if  the  phase  delays  are  available  we  can  make 
some  statements  concerning  the  constancy  of  the  offsets  given 
in  Equations  ( 4 . 1 . 1 . 13 ) - (4 . 1 . 1 . 1 5 )  .  Using  the  same  approach 
which  led  to  Equation  (4.1.1.16),  we  have,  from  Equations 
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(4.1.1.14)  and  (4.1.1.15)  that 

lx  S  (Tp~Tp)/(Y-a)  -  (Op-O®) / (y-a)  (4.1.1.18) 

and 

xn  =  ( yXp-atp)  /  (y-<i)  +  (yOp-aOp)  /  (y-a)  (4.1.1.19) 


If  we  now  compare  Equations  (4.1.1.16)  and  (4.1.1.18), 
we  see  that  these  equations  provide  two  estimates  of  the 
ionospheric  delay  which  should  differ  by  only  a  constant 
t-0®/ (1-P) - (0*-0®) /(y-o) ] .  In  Section  4.3  we  will  examine 

y  r  r 

the  behavior  of  this  difference  from  several  VLBI  experiments 
to  ascertain  the  constancy  of  these  offsets.  We  will  resume 
this  discussion  in  Section  4.3  and  investigate  methods  which 
can  be  used  to  determine  which  of  the  offsets  are  varying  if 
we  do  see  a  non-constant  difference  between  the  group-delay 
and  the  phase-delay  estimates  of  the  ionospheric  delay. 

The  estimates  of  the  non-dispersive  delay  will  be  used 
in  the  weighted  least  squares  analyses  to  be  discussed  in 
Chapter  5.  The  weight  given  to  each  measurement  in  these 
analyses  will  be  inversely  proportional  to  the  variance  of 
the  measurement  (see  Appendix  E  for  more  details) .  We  may 
easily  calculate  the  variance  of  the  non-dispersive  delay 
estimates,  given  the  the  variances  of  the  group-delay 


measurements,  at  X-  and  S-band,  from  Equation  (4. 1.1. 7).  The 

x  2  s 

variance  of  ,  (cn>  ,  will  be  (assuming  that  0^  is  constant) 


K)2  =  (P2(og)2  +  <0g>2)/(P-l> 


(4.1.1.20) 


x  2  s  2 

where  (n)  and  («  )  are  the  variances  of  the  X-  and  S-band 


group  delays,  and  we  have  assumed  that  the  X-  and  S-band 

group  delay  measurements  are  uncorrelated  (see  Section  2.1). 

Similarly,  the  variance  of  the  ionospheric-group-delay  esti- 
x  s  2 

mate,  (o  )  ,  will  be 

(<02  =  (<°a)2  +  (aa}  2)/(P-D  2  •  (4.1.1.21) 

yy  y  y 

We  will  investigate  these  variances  in  more  detail  in  Section 
4.2,  when  we  will  need  to  know  the  precision  of  the  phase- 
delay  predictions. 

We  have  now  derived  all  of  the  theoretical  relationships 
which  are  necessary  for  us  to  be  able  to  predict  the  phase 
delays  from  the  group  delays.  While  deriving  these  formulas 
we  seemed  to  make  assumptions  in  every  step  of  the  deriva¬ 
tions.  Therefore  before  proceeding  any  further  we  should 
evaluate  the  possible  magnitudes  of  the  neglected  terms. 


4.1.2  Accuracy  of  the  ionospheric  correction 


In  the  preceding  sections  of  this  chapter  we  have 
developed  a  model  for  the  frequency  dependence  of  the  ion¬ 
ospheric  delay  for  both  the  group  delays  and  the  phase 
delays.  While  deriving  these  results  (Equations  (4.1.8)  and 
(4.1.11)),  we  made  a  number  of  approximations  and  neglected 
several  terms.  In  this  section  we  will  investigate  the 
magnitude  of  the  effects  of  these  approximations  and 
neglected  terms. 

In  Table  4. 1.2.1  we  summarize  the  terms  which  were 
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Table  4. 1.2.1  Terms  neglected  in  the  derivation  of  the  dual 


frequency  ionospheric  delay  correction 


Neglected  term 


Symbol  used  in  text 


A.  Magnetic  field  term 

«£ 

4Im 

B.  Damping  term  (due  to  collisions) 

C.  Higher  order  terms  in  the 

the  refractive  index 

expansion 

D.  Ray  integral  through  the 

ionosphere  -  bending 

E.  Ray  integral  through  the 

ionosphere  -  scintillation 

•s 

•2 

★ 

These  symbols  are  used  to  denote  the  contribution  of  each  of 

these  terms  to  the  observed  delays.  The  superscripts  p  and  g 

denote  contributions  to  the  phase  delay  and  group  delay, 

respectively. 

neglected  in  the  derivations  in  Section  4.1.  We  will  now 
analyze  each  of  these  terms  and  try  to  place  upper  bounds  on 
their  contributions  to  the  ionospheric  delay  corrections. 

Each  term  has  been  given  a  letter  code  which  will  be  used 
when  the  magnitude  of  each  term's  contribution  is  listed  in  a 
summary. 

We  will  study  each  of  the  neglected  terms  in  the  order 
in  which  they  are  listed  in  Table  4. 1.2.1.  At  the  end  of 
this  section  we  will  tabulate  the  effects  of  each  of  the 
neglected  terms  for  ease  of  analysis. 

The  first  approximations  we  made  were  in  Equation 
(4.1.2)  when  we  neglected  the  effects  of  the  magnetic  and 
damping  terms.  We  will  firstly  consider  the  magnetic  term. 
When  an  external  magnetic  field  is  applied  to  a  plasma,  the 
plasma  becomes  birefringent ,  i.e.  the  phase  velocity  of  an 
electromagnetic  wave  propagating  through  the  plasma  depends 
on  the  wave's  polarization.  For  circularly  polarized  radia¬ 
tion  the  dielectric  constant  of  the  plasma  can  be  written  as 
(Jackson  1975,  p.  293) 

e+  =  eQ(l  ~  Wp/(w^±  uiUp)  )  (4. 1.2.1) 

where  the  upper  sign  refer  to  right  circular  polarization 
(IEEE  definition)  and  the  lower  sign  to  left  circular  polar¬ 
ization  and  wb,  the  precession  frequency,  is  given  by  ( MKS 
units) 

“b  =  lel  ®o’k/me  ~  S_1  for  IV*I  ~  0.5X10-W 

where  K  is  a  unit  vector  in  the  direction  of  propagation. 


Equation  (4. 1.2.1)  is  an  aprox imation  because  terms  of  the 
2 

order  of  u)b  have  been  neglected  during  its  derivation.  This 

2 

approximation  is  adequate  for  our  purposes  because  the  wb 
terms  are  small  compared  to  the  terms  which  are  small 
compared  to  the  primary  term. 

We  will  restrict  the  following  discussion  to  right 
circular  polarization.  Equation  (4. 1.2.1)  may  be  expanded  in 
Taylor  series,  which  yields, 

e+  *  eo(1  “  (u)p/a)?  +  (wpub/a)3)  ) 

The  two  leading  terms  in  the  expression  for  the  dielectric 

constant  are  the  terms  which  we  have  already  considered 

(Equation  (4.1.5)).  The  contribution  to  the  dielectric 

constant  from  the  magnetic  field,  Ae™,  will  be 

.  m  ,  2  ,3 

Ae+  * 

and  the  resultant  error  in  the  phase  refractive  index.  An 
will  be 

An™  =  WpO)b/(2u>3)  (4. 1.2. 2) 

To  determine  the  additional  phase  delay,  Ai^,  due  to  the 
magnetic  field  we  integrate  An™/c  through  the  ionosphere. 
Hence 

sq 

Al£  =  /  <dpU>b/  ( 2 oj 3 )  ds 
0 

If  we  now  substitute  for  Wp  we  obtain 


+  3 


will  in  general  be  a  function 


The  precession  frequency, 
of  s  because  of  the  variation  of  §Q  with  altitude  and  loca¬ 
tion.  However,  we  are  interested  in  only  bounding  the  magni¬ 
tude  of  this  term  and  we  will  assume  that  B  .fc  =|B  I  at  the 

o  1  o 

Earth’s  surface.  If  we  assume  is  constant  at  its  maximum 
value,  u>™,  the  contribution  of  the  magnetic  field  term  to  the 
phase  delay  will  be  (from  Equation  (4. 1.2. 3)) 


e2w™  N . 

D  1 


2cm  ew' 
e  o 


(4. 1.2. 4) 


We  may  now  determine  the  corresponding  bound  on  the 
group  delay  error,  AI^,  by  multiplying  Equation  (4. 1.2.4)  by 
w  to  find  the  phase  error  and  then  by  differentiating  the 
phase  error  with  respect  to  angular  frequency, 

e2w!"  N. 

lMn!l  <  - 5 — k  (4. 1.2. 4) 

'  cm  e  u)3 

e  o 

where  the  AI^  and  AI^  will  have  opposite  signs. 

To  evaluate  w™,  we  will  use  the  dipole  approximation  of 

the  Earth's  magnetic  field  which  accounts  for  95%  of  the 

total  field  strength.  The  dipole  moment  of  the  Earth's 

22  2 

magnetic  field  is  7.94x10  Am  which  yields  a  magnetic 

induction  at  the  antenns  site  of  (Stacy  1977,  pp.  211-212) 

-5  2  1^2  _2 

| B  |  =  3.07x10  (l+3cos*6)  Wb  m  4 

where  6  is  the  angle  between  the  dipole  axis  and  the  radius 

vector  to  the  site. 

At  mid-latitude  sites  J §o |  will  be  approximately 


0.5x10  Wb  m  which  yields  a  value  of  8.8x10  s  for  u.. 


If  we  take  an  extreme  value  for  the  integrated  electron 


18  2 

content  of  10  electrons/m  ,  the  magnitude  of  the 


contribution  of  of  magnetic  field  term  would  be 


AIP|  =  0.3  psec, 

m 1 


| A Ij*  |  «  0.6  psec; 


at  8.4  GHz 


|AI 


m ' 


15.4  psec. 


|AI^|  =  -30.9  psec;  at  2.3  GHz 


These  errors  will  lead  to  errors  in  ionospheric  delay-cor¬ 
rected  group  and  phase  delays  of  magnitude  1.9  psec  and  0.9  • 
psec,  respectively.  These  errors  are  small  compared  to  other 
inadequacies  in  the  geodetic  models  for  the  group  delays. 

The  second  term  which  was  neglected  in  Equation  (4.1.2) 
was  the  damping  term.  When  the  damping  term  is  considered 
the  dielectric  constant  becomes  complex  (in  the  sense  of  real 
and  imaginary  components)  and  is  given  by  (Jackson  1975,  p. 
287) 


.  2 


e  =  eQ [1  +  iwp/w (y-iw) ] 


which  when  separated  into  real  and  imaginary  parts  becomes 


e  =  e_(l  - 


k) 


l  Yu 


V  ) 


2  2  2  2 
<o>  +Y  )  u(r  ♦«  ) 


(4. 1.2. 6) 


The  imaginary  part  of  Equation  (4. 1.2.6)  will  cause  a  loss  in 
the  energy  of  the  wave  as  the  wave  propagates  through  the 
plasma.  The  real  part  will  affect  the  propagation  velocity 
of  the  wave.  The  damping  constant,  Y,  is  a  phenomenological 
model  of  damping,  i.e.  a  parameterization  which  seems  to 
model  damping  in  materials.  The  damping  appears  to  be  due  to 
collisions  between  electrons  and  molecules  and  y  is  inter- 
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preted  as  the  collision  frequency  between  electrons  and 
molecules  (see  Jackson  1975,  p.  287-288). 

In  order  to  bound  the  magnitude  of  the  effect  of  the 

damping  term  we  will  use  a  value  of  the  collision  frequency 

in  the  lower  part  of  the  ionosphere  (=*85  km  altitude)  .  The 

collision  frequency  of  electrons  and  neutral  atoms  has  been 

calculated  by  Nicolet  (1959) .  At  85  km  altitude  Nicolet 

gives  a  value  of  approximately  10®  see"*.  The  theoretical 

values  of  Nicolet  for  different  altitudes  are  higher  (by  up 

to  a  factor  of  two) ,  than  the  measured  values  given  by  Kane 

(1959) .  (Kane  used  the  relative  absorption  of  two 

polarizations  of  7.75  MHz  continuous  wave  (CW)  signals 

transmitted  from  a  rocket  ascending  between  60  and  85  km.) 

More  recent  measurement  of  the  collision  frequency  between 

ions  and  neutral  atoms  based  on  incoherent  radar  scatter 

observations  (Schlegel  et  al . ,  1980;  Murdin  1981)  can  also  be 

used  to  deduce  electron-neutral  atom  collision  frequncies. 

At  85  km  altitude,  the  maximum  measured  value  of  ion-neutral 

4  -1 

atom  collision  frequency  was  6x10  sec  .  This  value  may  be 
converted  to  the  electron-neutral  collision  frequency  by  the 
relationship  (Schlegel  et  al . ,  1980) 


where  y  and  y.  are  the  electron-neutral  and  ion-neutral 
in 

collision  frequencies,  respectively,  mg  and  m^  are  the 

electron  and  ion  masses,  T  and  T.  are  the  electron  and  ion 

e  l 

temperatures  and  the  factor  0.357  accounts  for  the  difference 


in  effective  collisional  cross  section  of  the  electron- 
neutral  and  ion-neutral  collisions.  The  mass  and  temperature 
ratios  account  for  the  difference  in  velocity  of  the 

electrons  and  ions.  If  we  take  the  ratio  of  the  mass  of  the 

^  ^  4 

ions  (mainly  NO  and  Oj)  to  electrons  to  be  “6x10  and  Tg/Tj 

to  be  unity,  the  electron-neutral  collision  frequency 

corresponding  to  Yjn  =  6x10*  sec  ^  would  be  =5x10**  sec 

The  collisional  frequency  is  highly  variable  and  the 
difference  by  a  factor  of  five  between  this  value  and 
Nicolet's  value  is  not  unexpected.  Since  we  are  only  inter¬ 
ested  in  bounding  the  magnitude  of  the  effects  of  collisions 
we  will  use  the  higher  value  of  y  *  5x10**  sec’^. 

Now  that  we  have  some  knowledge  of  the  magnitude  of  y  we 
may  return  to  the  analysis  of  Equation  (4. 1.2. 6).  Since  y  is 
small  compared  to  u>  we  will  again  expand  the  real  part  of 
dielectric  constant  in  series  which  yields 

e  *  eQ(l  “  (“p/“2)  +  (WpY2/w4) ) 


The  contribution  of  the  collision  frequency  will  lead  to  an 
error  in  the  phase  refractive  index,  Anc,  of 

AnC  =  WpY2/(2w4)  (4. 1.2. 7) 

We  will  now  assume  that  the  collision  frequency  remains 
constant  throughout  the  ionosphere  (this  assumption  will  lead 
to  a  large  over-estimation  of  its  effect) .  Integration  of 
Anc/c  through  the  ionosphere  leads  to  a  phase  delay  error  of 


2 

e 

2  cm  e 


(4. 1.2. 8) 


(4. 1.2. 9) 


and  a  group  delay  error  of 


■  - 


3e4 


_  jj  1 _ 

2cm  e  Ni  4 
e  o  u> 


Substitution  into  Equations  (4. 1.2. 8)  and  (4. 1.2. 9)  leads  to 
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(assuming  =  10  electrons/m  ) 


AI?  - 

1.7xl0-5 

psec , 

-S.lxlO-5 

psec ; 

at 

** 

• 

CO 

AIP  = 
c 

0.003 

psec, 

Aig  = 
c 

-0.009 

psec; 

at 

2.3 

These  errors  are  negligible. 

The  next  term  we  neglected  was  the  second  order  terms  in 


2  2. 

the  expansion  of  the  square  root  of  (l-Up/u  )  in  Equation 


(4.1.7).  When  the  second  order  terms  are  included  the  phase 
refractive  index  becomes 


np  =  1  -  (a)p/2u)?  +  (w£/u>2)2/8 


(4.1.2.10) 


The  error  in  the  phase  delay  due  to  the  neglect  of  the  second 
order  term  will  be 


■"e'o 


I  <  ds 


4  J  v 
8cw  0  v 


(4.1.2.11) 


To  compute  an  approximate  value  of  AIPr  we  will  adopt  the 
simplified  profile  of  the  electron  density  content  shown  in 
Figure  4. 1.2.1  .  We  assume  that  the  electron  density 
increases  linearly  with  decreasing  altitude  from  hm  to  h^  (or 
equivalently  with  increasing  distance  along  the  ray  path  from 
0  to  s ^ )  until  a  maximum  volumetric  density  of  N^ax  is  reach¬ 
ed.  The  electron  density  then  linearly  decreases  until 


altitude  hQ  is  reached  (equivalently ,  distance  sQ) .  The 
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total  integrated  electron  content  for  this  model  will  be 

Ni  =  CV2  * 

In  terms  of  the  total  integrated  electron  content,  we  find 


Substitution  of  this  result  into  Equation  (4.1.2.11)  yields 
for  the  phase  delay  error 


«£  ■ 


5  5  I 
6m  t  cs  a) 
e  o  o 


The  group  delay  error  will  be 


(4.1.2.12) 


AI 


g 

e 


q4N? 


-22  4 

2mecSoj 
e  o  o 


(4.1.2.13) 
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If  we  take  Ni*10  electrons/m  and  the  thickness  of  the 
ionosphere  (sQ)  to  be  800  km,  the  contribution  of  the  second 
order  expansion  term  to  the  X-  and  S-band  phase  and  group 
delays  will  be 


«!- 

9.xl0-4 

psec, 

AlJ  =  -3.xl0"3 

psec ; 

at  8.4  GHz 

«!• 

CM 

• 

O 

psec. 

Al^  »  -0.5 

psec? 

at  2.3  GHz 

These  errors  are  insignificant.  Even  if  the  thickness  of  the 
ionosphere  were  reduced  by  a  factor  of  10  (to  80  km)  the 
contribution  of  this  neglected  term  would  still  be  small. 

The  next  inadequacy  in  our  model  for  the  ionosphere 
delay  was  the  ray  integrals  given  in  Equation  (4.1.9).  There 
are  several  inadequacies  in  this  equation  which  we  will 


discuss.  These  inadequacies  are  1)  the  line  integrals  are 
along  the  ray  paths  through  the  ionosphere.  Since  the  iono¬ 
sphere  will  refract  the  radiation,  the  paths  of  integrations 
will  be  different  for  the  two  frequencies  used.  Hence  for 
the  two  frequencies,  the  integrated  electron  contents  could 
be  different;  2)  the  concept  of  a  ray-path  integration  will 
be  inadequate  if  there  are  inhomogeneities  in  the  ionosphere 
on  scales  smaller  than  the  Fresnel  zone  diameter  of  radiation 
from  the  height  of  the  ionosphere.  We  will  now  discuss  these 
errors  in  the  order  listed. 

The  difference  in  the  integration  paths  can  be  treated 

within  the  framework  which  we  have  already  developed.  We 

will  assume  that  the  ionosphere  is  a  plane  stratified  medium 

(this  assumption  should  be  adequate  since  we  are  studying  an 

error  which  should  be  small;  if  the  error  is  large  then  we 

should  re-evaluate  the  plane  stratified  assumption) .  Figure 

4. 1.2. 2  shows  our  idealized  model  of  the  layered  ionosphere. 

For  a  plane  stratified  medium  the  angle  between  ray  direction 

before  entering  the  medium  and  the  ray  direction  in  a  layer 

with  refractive  index  ^(s)  will  be  given  by  Snell's  law 

sin  0^/sin  0E(s)  *  np(s)/n0  (4.1.2.14) 

where  n„  is  the  refractive  index  in  the  region  outside  the 
o 

medium  (for  our  purposes  we  can  take  nQ  to  be  unity) .  As  we 
have  seen  already  n  ( s )  deviates  from  n  by  only  a  small 

r  w 

2  2 

amount,  (»u>p/2u)  ),  and  hence  we  will  expand  Snell's  law  in 
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approximately 

A0  (s)  “  -tan  9  ^  Anp(s)  (4.1.2.15) 

where  Anp(s) =np(s)-nQ.  Equation  (4.1.2.15)  allows  us  to 
deduce  both  the  deviation  of  the  path  through  the  ionosphere 
from  a  straight  line  and  the  additional  path  travelled  by  the 

radiation  because  of  the  ray  curvature.  The  offset,  AO, 

between  the  actual  path  travelled  by  the  radiation  and  a 
straight  line  path  through  the  ionosphere,  will  be 

S- 

o  o 

Ao  3  /  A0  ds  *  -  tan  9.  /  An  ds  (4.1.2.16) 

0  1  0  p 

The  integral  in  Equation  (4.1.2.16)  has  already  been  eval¬ 
uated.  This  integral  is  simply  the  ionospheric  phase  delay 

multiplied  by  the  speed  of  light.  For  an  integrated  electron 
18  2 

content  of  10  electrons/m  ,  the  deviations  of  the  S-band 
signal  from  a  straight  line  would  be  AO  “  7.5  tan  6^  m. 

This  offset  would  appear  to  grow  very  large  for  low 
elevation  observations  (i.e.,  9^-90°).  This  behavior  is, 
however,  an  artifact  of  the  plane  approximation  which  was 
used  to  derive  Equation  (4.1.2.16).  When  we  consider  a 
curved  Earth  (Figure  4. 1.2.3)  an  observation  at  0°  elevation 
angle  at  the  surface  of  the  earth  (neglecting  tropospheric 
refraction)  would  impinge  on  a  spherical  layer  hm  above  the 
Earth's  surface  at  a  zenith  angle  of 
e°  =  sin-1  (Re/IVV) 

where  Fe  is  the  radius  of  the  Earth.  If  we  take  hm  to  be  300 
kms,  9°  would  be  73  deg.  The  maximum  deviation  of  an  S-band 
path  from  a  straight  line  would  then  be  =*25  m  in  this  case. 


Geometry  of  impinging  angle  for  a  spherical  ionospheric 
layer 


rotation  axis  of  the  Earth 
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If  large  variations  in  the  electron  content  occur  on  scales 
of  25  meters,  then  the  integrated  electron  content  would  be 
different  for  the  two  frequencies.  However  if  this  variation 
were  to  occur,  then  the  ray  approximations  we  have  been  using 
thus  far  would  no  longer  be  adequate.  We  will  continue  this 
discussion  when  we  discuss  the  adequacy  of  the  ray  trace 
method . 

A  second  consequence  of  the  path  of  the  radiation  being 
deflected  as  it  propagates  through  the  ionosphere  is  that  the 
path  length  is  increased.  The  contribution  to  the  phase 
delay  from  the  additional  path  length  due  to  ray  bending  will 
be  approximately 

sq 

AlJjJ  =  (1/c)  £  A62/2  ds 

which,  by  substituting  Equation  (4.1.2.14)  into  the  above 
equation,  becomes 


If  we  refer  back  to  Equations  (4.1.2.10)  and  (4.1.2.11)  we 


see  that 

AlP  =  tan29.  AIP  (4.1.2.17) 

b  i  e 

where  AIP  is  the  phase  delay  error  due  to  the  second  order 
terms  in  the  expansion  of  Op  (see  Equation  (4.1.2.12)).  The 
group-delay  contribution  of  the  bending  term  will  be 


Alg  =  tan2ei  |Ai||  (4.1.2.18) 

For  an  observation  at  0°  elevation  on  the  surface  of  the 
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earth  and  for 

an 

integrated 

electron  content  of 

1018 

2 

electron/m  , 

the 

contribution  of 

the  additional 

path  length 

will  be 

-  «• 

003 

psec. 

4Ib 

=  0.009  psec; 

at  8.4 

GHz 

“g  *  !• 

7 

psec, 

«g 

=  5.0  psec; 

at  2.3 

GHz 

Again  these  contributions  are  small. 

We  will  now  study  the  second  aspect  of  the  ray  path 
integration  --  scintillation.  We  calculated  in  an  earlier 
part  of  this  section  that  the  ray  path  of  the  S-band  signals 
could  deviate  from  a  straight  line  by  up  to  25  m.  If  large 
inhomogeneities  in  the  plasma  density  are  present  on  scales 
of  25  m,  then  the  use  of  a  ray  to  represent  the  propagation 
of  radio  wave  is  invalid.  We  will  now  investigate  the 
effects  of  small  scale  ionospheric  inhomogeneities  on  the 
propagation  of  radio  waves. 

There  are  two  methods  we  could  use  to  study  the  effects 
of  scintillation  on  VLBI  observations.  One  is  to  use 
observations  of  scintillation  on  satellite  signals  and  the 
other  is  a  theoretical  approach.  Because  we  are  interested 
in  bounding  the  effects  of  scintillation  we  will  adopt  the 
former  aproach.  (The  reader  is  referred  to  the  works  of 
Booker,  1975;  Costa  and  Kelly,  1976;  Rino,  1979a;  Rino, 

1979b;  and  Bogusch  et  al . ,  1981,  for  discussions  of  the 
theoretical  methods  of  studying  scintillation.  These  works 
are  primarily  concerned  with  studying  the  ionosphere  using 
scintillation  observations  which  is  the  inverse  of  our  aim  in 


this  study.)  The  most  extensive  study  of  scintillation  of 
satellite  radio  transmissions  has  been  made  by  Fremouw  et  al. 
(1978) .  This  work  studied  the  phase  and  amplitude  fluctua¬ 
tions  of  10  mutually  coherent  continuous  wave  (CW)  signals 
transmitted  from  Defense  Nuclear  Agency  (DNA)  satellite  P76 
-5.  The  ten  CW  signals  were  distributed  in  frequency  between 
VHF  and  S-band .  These  experiments  confirm  that  the  frequency 
dependence  of  the  root-mean-square  (RMS)  phase  fluctuations 
obey  the  inverse  frequency  law  which  is  predicted  from  the 
theoretical  models  (see  references  above) . 

Figure  4. 1.2. 4  reproduces  the  phase  RMS  versus  frequency 

plots  from  Fremouw  et  al .  (1978)  for  Poker  Flat  (geomagnetic 

dip  latitude  65?4  N)  and  Ancon  (geomanetic  dip  latitude  0?4 

N) .  We  believe  that  these  are  the  only  results  from  the  DNA 

Wideband  experiment  which  have  been  analyzed.  The  Poker  Flat 

results  are  most  applicable  to  the  mid-latitude  sites  which 

have  been  used  in  the  VLBI  experiments  we  will  be  studying. 

The  results  bound  the  scintillation  effects  at  25  psec  for 

o 

the  S-band  phase  delays  (corresponding  to  20  phase  RMS 
error) ,  and  2  psec  for  the  X-band  phase  delays  (corresponding 
to  5°  RMS  phase  error) . 

Observations  of  group  delay  scintillations  are  non¬ 
existent  although  Bogusch  et  al,  (1981)  address  this  problem 
from  a  theoretical  aspect.  We  can  place  upper  and  lower 
bounds  on  the  scintillation  errors  based  on  the  phase-delay 
results.  The  lower  bound  for  the  group-delay  scintillation 
will  be  the  same  as  the  phase  delay  scintillation,  assuming 
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Figure  4. 1.2.4  Phase  scintillation  as  a  function  of  frequency 
(reproduced  from  Fremouw  et  al. .  1978) 


Poker  Flat.  Alaska 

latitude  65°  7'  N 

longitude  147°  29*  W 

geomagnetic  dip 
latitude  65?4  N 


Ancon.  Peru 
latitude  11°  46' S 
longitude  77°  9'  W 


geomagnetic  dip 
latitude  0?4  N 


that  the  phase  scintillations  are  perfectly  correlated  across 
the  bandwidth  used  to  measure  the  group  delay.  The  upper 
bound  can  be  calculated  assuming  the  phase  errors  due  to 
scintillation  are  uncorrelated  across  the  observing  band¬ 
width.  At  S-band,  the  frequency  range  used  to  estimate  the 
group  delay  is  “75  MHz  and  20°  phase  errors  in  three  channels 
(at  2.295  GHz,  2.270  GHz  and  2.220  GHz)  spanning  this  band¬ 
width  would  lead  to  group-delay  errors  of  *1  nsec.  At 
X-band,  the  spanned  bandwidth  is  *300  MHz  and  5°  RMS  phase 
errors  in  4  channels  (at  8.34  GHz,  8.49  GHz,  8.54  GHz,  and 
8.24  GHz)  would  lead  to  group-delay  errors  of  *0.06  nsec. 
These  upper  bounds  are  most  likely  far  too  large. 

We  can  obtain  much  better  estimates  of  both  group-delay 
and  phase-delay  scintillation  effects  from  VLBI  data. 
Scintillations  are  due  to  small  scale  irregular ites  in  iono¬ 
spheric  plasma  density.  The  Fresnel-zone  diameter  at  a 
height  of  350  kms,  for  a  radio  source  an  infinite  distance 
from  the  observer  and  an  observing  frequency  of  2.3  GHz,  is 
*430  m.  (The  Fresnel-zone  diameter  at  a  distance  d^  from  the 
observer  with  the  source  at  infinity  is  2/KcT^  where  k  is  the 
wavelength  of  the  observed  radiation.)  Most  of  the  scintil¬ 
lation  will  be  caused  by  ionospheric  density  fluctuations 
which  have  spatial  wavelengths  of  less  than  the  Fresnel-zone 
diameter.  By  looking  at  VLBI  measurements  of  ionospheric 
delay  on  a  short  baseline  we  should  be  able  to  observe 
directly  the  scintillation,  or  at  least  place  bounds  on  its 
magnitude.  In  Figures  4.1.2.5a  and  b,  we  show  the  iono- 


Figure  4. 1.2. 5. b  Ionospheric  delay  correction  at  X-band  computed  from 
X-band  and  S-band  phase  delays. 
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Figure  4. 1.2. 5. c  Ionospheric  delay  correction  at  X-band  computed  from 
X-  and  S-band  phase  delays  (same  as  Figure  4. 1.2. 5. b 
except  plotted  on  an  enlarged  scale) 
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spheric  delay  at  X-band  for  the  Haystack-Vfestford  baseline 
(the  length  of  this  baseline  a1.24  km),  measured  from  both 
group  and  phase  delays.  Figure  (5.1.2.5c)  is  the  same  as 
Figure  (4.1.2.5b),  but  plotted  on  a  much  larger  scale.  The 
ionospheric  delay  measurements  on  this  baseline  should  only 
be  sensitive  to  small  scale  ionospheric  density  variations. 
The  RMS  scatter  of  the  group-delay  ionospheric  measurements 
is  15.3  psec.  The  expected  scatter  of  the  results,  assuming 
that  the  ionospheric  delay  is  zero  is  9.0  psec.  The  differ¬ 
ence  between  the  observed  scatter  and  the  expected  could  be 
accounted  for  by  ionospheric  scintillations  with  12.4  psec 
RMS  at  X-band.  This  RMS  places  a  bound  on  the  contribution  of 
scintillation  to  the  X-band  group  delay  measurements  of  the 
ionospheric  delay.  We  believe  that  a  large  portion  of  the 
12.4  psec  additional  scatter  is  due,  not  to  scintillation, 
but  to  instrumental  errors  which  are  not  accounted  for  in 
computing  the  group-delay  uncertainties  (see  discussions  in 
Chapter  3 ) . 

The  phase-delay  measurements  of  the  ionospheric  delay  on 
the  Haystack-Westf ord  baseline  show  a  scatter  which  is 
consistent  with  the  uncertainties  of  the  phase  delays.  We 
have  shown  these  results  in  two  figures,  the  first  plotted  at 
the  same  scale  as  the  group-delay  measurements  of  the  iono¬ 
spheric  delay  (Figure  4.1.1.5b)  and  the  other  at  a  much 
larger  scale  (Figure  4.1.2.5c). 

The  RMS  scatter  of  these  phase-delay  results  is  0.34 
psec.  We  see  no  evidence  of  scintillation  at  all  in  these 
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results  because  the  scatter  of  the  results  is  0.8  times  the 
expected  scatter  of  these  measurements,  given  the  uncertain¬ 
ties  of  the  phase  delays  used  in  the  calculations  which 
indicates  that  the  presence  of  additional  noise  sources  in 
unlikely. 

These  results  are  of  course  not  definitive  but  they  do 
indicate  that  scintillation  should  not  be  a  major  source  of 
error  (at  least  in  the  June,  1981  experiment) . 

In  Table  4. 1.2. 2  we  summarize  the  expected  error  in  the 
dual- frequency  ionospheric-delay  calibration  due  to  each  of 
the  contributions  discussed  in  this  section.  Even  the 
largest  term  which  would  be  systematic  in  nature  is  less  than 
1  mm,  i.e.  the  term  due  to  magnetic  field.  The  magnetic- 
field  error  source  could  be  reduced  to  a  very  small  amount  by 
the  adoption  of  tri-frequency  receivers,  i.e.  to  first  order 
the  magnetic  field  contribution  to  the  group  delays  is 
inversely  proportional  to  frequency  cubed.  Hence  by 
observing  in  three  frequency  bands  we  could  separate  this 
contribution  from  the  inverse  frequency  squared  term.  An 
alternative  approach  would  be  to  combine  the  group  and  the 
phase  delays  in  a  dual-frequency  system.  At  present  it  seems 
that  the  instrumental  errors  in  the  group  delays  are  too 
large  for  the  magnetic  field  term  to  be  successfully 
recovered  using  this  technique  (see  Section  4.2). 

Scintillation  could  potentially  cause  large  random 
errors  in  the  ionospheric  delay  calibration,  but  the  results 
from  the  Haystack-Westf ord  baseline  seem  to  bound  this  error 
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Table  4. 1.2. 2  Summary  of  the  magnitudes 

of  the 

errors  in  the 

dual  frequency  ionospheric 

correction 

Term 

Error  in 

X-band 

Error 

in  estimated 

ionospheric  delay 

non-di 

spersive  delay 

correction 

phase 

group 

phase 

group 

(psec) 

(psec) 

(psec) 

(psec) 

A.  Magnetic  field 

1.2 

2.4 

0.9 

1.8 

B.  Damping 

0.0002 

0.0007 

0.0002 

0.0007 

C.  Higher  order 

0.02 

0.04 

0.02 

0.04 

terms 

D.  Bending 

0.1 

0.4 

0.1 

0.04 

* 

E.  Scintillation 

RMS 

RMS 

RMS 

RMS 

<1.8 

<12.0* 

<2.7 

<18.0* 

The  values  in  this  table  are  appropriate  for  observations  at 
8.4  GHz  and  2.3  GHz. 

*  The  contribution  of  the  scintillation  term  is  expressed  as 
an  RMS  because  its  contribution  will  be  random. 

*  These  values  are  based  on  the  VLBI  measurements  of  the 
ionospheric  delay  on  a  short  baseline  (Haystack-Westf ord  *1.2 
km  length)  in  Junef  1981. 


Vi’.  V 


L  MV 
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The  technique  of  eliminating  the  phase  delay  ambiguities 
is  essentially  an  application  of  Equations  (4.1.1.16)  through 
(4.1.1.19).  The  procedure  we  have  adopted  is  to  firstly 
calculate  the  ionospheric  delay  correction  and  the  "non-dis- 
persive"  delay  from  the  group  delay  observations  at  X-  and 
S-band.  These  values  are  then  used  to  predict  the  X-  and 
S-band  phase  delays.  The  number  of  phase  delay  ambiguities 
are  then  calculated  such  that  the  phase  delays  (calculated 
from  the  visibility  phases)  will  be  within  half  an  ambiguity 
of  the  predicted  phase  delays.  We  then  reverse  the  process 
and  predict  the  expected  values  of  the  group  delays  based  on 
the  phase  delays.  This  final  stage  is  useful  in  analyzing 
errors  in  the  group  delays. 

Prediction  of  phase  delay  ambiguities  based  on  group 
delays  is  a  new  technique.  In  order  to  check  these  procedures 
we  will  first  analyze  experiments  which  allow  the  phase  delay 
ambiguities  to  be  removed  independently  of  the  group  delays. 
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These  experiments,  carried  out  in  July  1980  and  June  1981, 
consisted  of  observing  two  sources,  close  in  the  sky,  which 
could  be  repeatably  observed  in  a  cyclical  manner  for  many 
hours  (six  hours  in  July  1980  and  twelve  hours  in  June  1981 
for  one  pair,  0?5  apart  in  the  sky,  and  three  hours  for  a 
second  pair  10?0  apart) .  During  the  short  interval  between 
successive  observations  on  each  source,  we  can  model  the 
change  in  phase  to  much  better  than  half  an  ambiguity,  in  most 
cases,  thus  allowing  each  phase  delay  observation  to  be 
"connected"  to  the  ones  before  and  after  it. 

Before  discussing  the  results  of  these  experiments  we 
will  first  study  the  problem  of  phase  delay  ambiguity  predic¬ 
tion.  In  particular  we  will  need  to  know  the  effects  of 
random  errors  in  the  group  delays  on  the  phase  delay  predic¬ 
tions.  We  will  also  need  to  know  the  effects  of  phase  delay 
ambiguities  on  the  prediction  of  the  group  delays. 

The  phase-delay-ambiguity  prediction  algorithms  are 
derived  from  Equations  (4.1.1.16)  through  (4.1.1.19).  We 
reproduce  these  equations  here,  with  an  additional  term  added 
to  each  equation  to  account  for  observation  noise  (either 
random  or  systematic) .  The  relationships  between  our  various 
observations  are: 


(4.2.1) 

(4.2.2) 


(4.2.3) 


t_  =  x_  -  yl  +  C>  +  e_ 
p  n  x  p  p 


(4.2.4) 


where  e*,  e®,  e*,  and  e®  are  additive  noise  contributions  to 
g  g  p  p 

the  X-  and  S-band  group  delays,  and  the  X-  and  S-band  phase 
delays,  respectively.  The  phase  delays  given  in  Equations 
(4.2.3)  and  (4.2.4)  are  not  normally  known.  The  quantities 
which  we  do  know  are  the  visibility  phases,  A  and  A  (cor- 

X  s 

rected  for  feed  rotation;  see  Section  4.1.2),  which  are 
related  to  the  phase  delays  by  (see  Appendix  A) 


’p  =  <2*NX  +  *x)A,x 


(4.2.5) 


Tp  =  (2uNs  +  As)/^s 


(4.2.6) 


where  u>  and  w  are  the  (angular)  RF's  in  each  band.  The 
X  s 

(integral)  numbers  N  and  N  are  the  phase  delay  ambiguities. 
These  are  the  numbers  which  we  wish  to  predict  based  on  the 
group  delay  measurements.  Normally,  we  make  crude  estimates 
of  N  and  N  ,  and  then  calculate  corrections,  AN  and  AN  ,  to 

X  S  X  5 

these  estimates  in  order  to  find  the  values  which  satisfy 
Equations  (4 . 2. 1) - (4 . 2. 4) .  The  phase  delays  calculated  from 
the  crude  estimates  of  the  ambiguities  and  the  visibility 
phases  will  be  called  the  a  priori  phase  delays. 

An  obvious  approach  to  estimating  N  and  N  is  to  manipu- 
late  Equations  (4.2.1)  and  (4.2.2),  and  form  estimates  of  the 
phase  delays  based  on  Equations  (4.2.3)  and  (4.2.4).  A  simple 
rearrangement  of  Equations  (4.2.1)  and  (4.2.2)  yields. 


-xf)  /  (1-3)  +  (0f  +  ef-e*)/(l-p) 


(4.2.7) 
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and 

tn  =  (p-tg  -tJ)  /  0-1)  +  (0®+eg-peg)/(P-D  •  (4.2.8) 

Since  t*  and  are  the  only  measured  quantities  in  Equations 
(4.2.7)  and  (4.2.8),  we  can  form  estimates  of  Ix  and  ^  using 
the  first  terms  on  the  right  hand  sides  of  Equations  (4.2.7) 
and  (4.2.8).  These  estimates  will  be  given  by 

(Vgg  =  <*g  -xJj/d-P)  (4.2.9) 

and 

(tn>gg  “  {Pxg  (4.2.10) 


and  they  will  be  related  to  Ix  and  Tn  by 


(0®  +e®  -e*)/(l-p) 
g  g  g 

(°g  +*g  -f>e*)/(f!-l> 


(4.2.11) 

(4.2.12) 


where  the  superscripts  xs  and  corresponding  subscripts  gg 
denote  the  observation  types  used  to  make  the  estimate.  (In 
practice,  an  estimate  of  Og  is  made  before  Equations  (4.2.9) 
and  (4.2.10)  are  evaluated.  The  0®  given  in  Equations 
(4.2.11)  and  (4.2.12)  can  then  be  treated  as  a  correction  to 
the  a  priori  value.) 

We  may  now  make  predictions  of  the  phase  delays  based  on 
Equations  (4.2.11)  and  (4.2.12),  and  Equations  (4.2.3)  and 
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(4.2.4).  The  predictions  are 


.-x.  xs  ,a  .xs  .  xs 

( x  )  =  ( x  )  -  a  I 

p  gg  n  gg  x  gg 


(4.2.13) 


,  s ,  xs  .  ,xs  ,xs 

TP  gg  ~  Tn  gg  ~  Y  *x^gg 


(4.2.14) 


Some  simple  algebra  shows  that  the  difference  between  the 
a  priori  phase  delays  and  the  predictions  will  be 

V<*p>gg  *  -2”4V“x+  °p+  'p+  flK.noJ.e'l-MI.’l/lf-l). 


(4.2.15) 


and 

S  . « S .  X  S  a.  _Sj.  r  / 1  /nS,_S.  ,__j_o\_X 


V^p'gg  =  -2«4«s/"s+  °p+  Ep*  t  (!+y)  (0g+4g)-  (,+P)  Eg)/  0-1) 


(4.2.16) 


where  **  and  are  the  a  priori  estimates  of  x*  and  x®  based 

on  assumed  values  of,  N°  and  N°,  N„  and  N0  (i.e.,  = 

x  s  x  s  p 

(2itN°+<J>  )/<i>  );  and  the  actual  number  of  ambiguities  will  be 

XX  X 

given  by  NX=N°+ANX*  Similar  relationships  hold  for  the  S  band 
ambiguities. 

Equations  (4.2.15)  and  (4.2.16)  provide  us  with  a  method 
of  calculating  an  and  AN  ,  but  we  should  notice  that  these 
estimates  will  be  corrupted  by  the  offsets  0®,0*  and  0®,  and 

?  r  r 

X 

the  noise  contributions  to  each  of  the  delay  measurements  e^, 
£g'  Ep'  a"d  ep’ 

It  will  be  instructive  to  consider  each  of  these  corrup¬ 
tions  (the  offsets  and  the  noise),  separately.  The  offsets, 
if  constant,  are  more  of  an  annoyance  than  a  problem.  (In  our 
formulation  any  variations  in  the  offsets,  by  definition,  are 


absorbed  into  the  noise  contributions.)  If  we  rewrite 
Equations  (4.2.15)  and  (4.2.16)  excluding  the  noise  terms,  we 
have 

V(*p}gg  *  -2**W  °p+  (4. 2. 17. a) 

and 

Tp-(tp)gg  -  -2nANgA>s+  0®+  (l+y)0®/(P-l)  .  (4.2.17.b) 

We  have  two  equations  above,  and  5  unknowns,  although  two  of 
these  unknowns  (AN  and  AN  )  must  be  integral.  Therefore, 
from  the  first  pair  of  prediction  differences,  we  can  set  the 
non-integral  parts  of  the  prediction  differences  equal  to  the 
offsets.  Subsequent  prediction  differences  could  then  be  used 
to  check  the  constancy  of  the  offsets.  We  should  note  that 
there  is  no  unique  separation  of  the  ambiguity  corrections  and 
the  offsets,  i.e.  AN  and  AN  can  be  changed  by  any  integral 
values  (but  the  same  ones  for  all  observations)  and  these 
changes  can  be  absorbed  into  the  offsets.  These  arbitrary 
shifts  of  the  phase  delays,  by  constant  amounts,  pose  no 
serious  problem  because  they  are  indistinguishable  from  an 
epoch  offset  between  the  hydrogen-maser  frequency  standards  at 
each  site. 

We  now  turn  our  attention  to  the  noise  contribution  to 
the  prediction  differences.  These  contributions  do  pose  a 
serious  problem  for  ambiguity  prediction,  if  the  noise  is 
large.  Equations  (4.2.15)  and  (4.2.16),  omitting  the  "0" 


terms,  are 


X  ,.xvxs 

V  Vgg 


and 


s  ..s.xs 

T  - { T_ ) __ 

p  p  gg 


-2nAN  / us  +  e*+  [  (1  +  a)  e®-  (a+0)  e*]  /  0-1) 
x  x  r  9  9 

(4. 2. 18. a) 

-2itANg/(os+  Ep+  [  (1  +  y)  €g- (y  +  p)  e*] /(p-1) 

(4 . 2. 18 .b) 


In  the  following  discussion  we  will  consider  the  effects  of 
several  different  forms  of  noise  on  the  prediction  process. 
Before  doing  so  we  will  compress  the  notation  of  Equations 
(4.2.18).  We  will  write  the  equations  as 

*Pp  -  -2"*V"x+  *p  +  <Cx>M  -  <Cx>g*g  (4. 2. 19. a) 

and 

tfp  -  -2XAVV-  4  ♦  <Cs>geg  -  (Cs>gEg  (4.2.19.b) 

where  AP*  and  AP®  are  the  prediction  differences  at  X-  and 
S-band,  respectively,  and  <Cx)g,  (Cx)*,  (Cg)®,  and  (Cg)*  are 
the  coefficients  which  multiply  the  group  delay  errors  in 
Equations  (4.2.18).  The  values  of  these  coefficients  for 
a)  /(2t[)»8.3  GHz  and  w  /  (2  n)  =2 . 3  GHz  are  (C  )^=0.16, 

X  s  X  y 

(Cx)g=1.16,  (Cg)  gjsl .  16 ,  and  (Cg)*=2.16,  for  a=1.0,  and 
psy=13.5.  (<x»p  and  y  were  the  ratio  of  the  respective 
effective  frequencies  squared;  see  Equations  (4.1.1.12)  to 
(4.1.1.16) .) 

Clearly  from  Equations  (4.2.19)  we  can  see  a  coupling  of 
the  effects  on  the  prediction  errors,  of  the  errors  in  the 
group  delays  at  X-  and  S-bands,  i.e.,  an  error  in  the  S-band 


group  delay  will  affect  the  prediction  of  both  the  S-band  and 
the  X-band  phase-delay  ambiguities.  We  may  statistically 
calculate  this  coupling  by  computing  the  variance-covariance 
matrix  (or  simply  covariance  matrix)  of  the  prediction  errors. 
If  the  group-delay  errors  are  uncorrelated ,  and  if  we  neglect 
the  error  contribution  of  the  phase  delays  (these  are 
approximately  40-fold  smaller  than  the  group  delay  errors) , 
the  covariance  matrix  of  the  prediction  errors  will  be  given 
by 

VAP  =  T  <(APp)2>  <(AP*)(AP®)> 

_<(AP*)  (AP®)>  <(AP®)2> 

1.35 (c*)2+0. 026  («j®)2  -2.51(o*)2+0.019(a®)  2 

y  y  y  y 

symmetric  4.67 (a*) 2+1.35 (a®) 2 

(4.2.20) 

X  2  s  2 

where  (a*)  and  (®g)  are  the  variances  of  the  X-  and  S-band 
group  delays ,  respectively,  and  where  we  have  substituted  the 
numerical  values  of  the  coefficients  in  the  covariance  matrix. 

In  order  to  resolve  correctly  the  phase  delay  ambigu¬ 
ities,  we  need  the  prediction  errors  to  be  much  less  than  one 
cycle  at  each  frequency.  In  Figure  4.2.1,  we  plot  the  bounds 
on  the  group  delay  uncertainties  such  that  the  phase-delay 
prediction  differences  will  have  uncertainties  less  than  0.030 
nsec  (*0.25  cycles)  and  0.110  nsec  (*0.25  cycles)  at  X-  and 
S-band,  respectively.  (These  uncertainties  represent  1/4  of  a 
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cycle  at  each  frequency.)  From  the  figure  we  see  that  in 
order  to  simultaneously  predict  both  the  X-  and  S-band 
ambiguities,  we  need  to  know  the  group  delays  with  uncertain¬ 
ties  of  <0.025  nsec  and  <0.090  nsec,  respectively,  at  these 
two  frequencies.  These  bounds  put  very  stringent  accuracy 
requirements  on  the  Mark  III  system  if  we  want  to  predict 
phase-delay  ambiguities  reliably.  In  Section  4.3,  we  will 
investigate  whether  or  not  the  Mark  III  system  can  meet  these 
accuracy  requirements. 

After  the  estimates  of  the  phase  delay  ambiguities  have 
been  obtained,  the  resultant  phase  delays  are  used  to  obtain 
estimates  of  the  group  delays.  The  differences  between  the 
observed  values  of  the  group  delays  and  their  predicted  values 
can  be  used  to  study  the  errors  in  the  group  delays  and  to 
detect  any  remaining  ambiguity  errors.  We  will  refer  to  these 
differences  as  "apparent  group  delay  errors." 

The  relationship  between  the  observed  group  delays  and 
the  predictions  based  on  the  phase  delays  may  be  derived  in  a 
manner  similar  to  that  used  to  derive  Equations  (4.2.15)  and 
(4.2.16).  The  predicted  group  delays  will  be  given  by 

(Tg)pp  -  ((Y+lHp  -  (1+a)  Tp)  /  (y-a)  (4.2.21) 

and 

(*g)pp  *  ((Y+P)t£  -  (P+a) t*) / (Y-®)  (4.2.22) 
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and  the  errors  in  the  predictions  will  be 


Tg"(?g)pp  *  eg  '  <Y  +D  (o£+2icANx/u>x+e£)  /  (y-a ) 
+  (l+o  )  (Op  +  2*ANs/a)g+ep)  /  (y-a) 


and 


Tg~(^g)pp  =  eg  "  <Y+P)  (Op+2nANx/a>x+e*)/(Y-a) 

+  (a +p  )  (0“*+2nAN  A)  +e  **)  /  (Y-a ) 
p  s  s  p 


(4. 2. 23. a) 


(4.2. 23. b) 


As  we  did  earlier  in  this  section,  we  will  consider  parts  of 
the  above  equations  separately.  We  consider  firstly  the 
effects  of  the  phase-delay  ambiguities.  In  this  case, 

Tg  “(?g}pp  =  "  0-139ANx  +  0.070ANg  (nsec)  (4. 2. 24. a) 

and 

-(Tg)pp  =  -  0.259ANX  +  0.510ANS  (nsec)  (4.2.24.b) 

where  we  have  substituted  the  approximate  numerical  values  for 
the  constants  given  in  Equations  (4.2.23).  The  above  equa¬ 
tions  highlight  one  of  the  disadvantages  of  the  choice  of  X- 
and  S-band  for  the  Mark  III  system  when  phase-delay  ambiguity 

prediction  is  attempted.  If  AN  *2  and  AN  *1,  then  the  differ- 

X  s 

ence  between  the  observed  X-band  group  delay  and  the  value 
predicted  from  the  phase  delays  would  differ  by  only  0.001 
nsec.  Of  course,  in  this  case  the  difference  between  the 
observed  S-band  group  delay  and  its  prediction  would  be  0.761 


nsec 


However,  there  are  cases  where  the  noise  contribution 


to  the  S-band  group  delays  could  be  this  large  (see  Section 
4.3).  In  this  case,  these  ambiguity  errors  could  not  easily 
be  detected. 

In  general,  we  see  from  Equations  (4.2.24)  that  ambigu¬ 
ities  in  the  phase  delays  produce  differences  between  the 
observed  group  delays  and  the  predictions  of  multiples  of 
<*0.070  nsec  and  *0.26  nsec  at  X-  and  S-band,  respectively. 
The  process  of  phase-delay-ambiguity  prediction  is  somewhat 
harder  than  may  have  been  initially  thought,  given  the 
ambiguity  spacings  of  “0.12  nsec  and  “0.44  nsec  at  X-  and 
S-band,  respectively,  because  of  the  coupling  of  the  X-  and 
S-band  errors.  We  should  also  notice  that  changing  the 
ambiguities  at  one  frequency  will  affect  the  implied  group 
delay  errors  at  both  X-  and  S-band. 

The  noise  contributions  to  the  apparent  group-delay 
errors  may  also  be  calculated  from  Equations  (4.2.23).  In 
this  case. 


x  ,.x.xs  x  .  ,,_x  ,  n  .,_s 
t  -  ( i  ) __  =  e  -  1.16e  +  0 . 16 e 

9  g  PP  9  P  P 


and 


S  .  a  S .  X  S  S  a  ^  £  X  ,  i  i  r  .  s 

-  (t  )„  =  -  2.16e_  +  1.16e 

g  g  pp  g  p  p 


(4. 2. 25. a) 

(4. 2.25.b) 


where  again  we  have  substituted  the  numerical  values  of  the 

coefficients.  If  we  firstly  consider  random  error  only,  then 

e*  <<  and  e®  <<  e®,  and  the  major  contribution  to  the 
P  9  P  g  J 

differences  given  above  will  be  from  the  group  delays.  If  the 
errors  in  the  group  delays  are  too  large  then  the  phase  delay 
ambiguities  will  be  resolved  incorrectly  (if  no  other  means 


are  available  for  determining  the  phase  delay  ambiguities) , 
and  the  differences  between  the  observed  group  delays  and 
their  estimates  based  on  phase  delays  will  be  smaller  than  the 
actual  errors. 

Figure  4.2.2  allows  us  to  study  the  "folding"  of  group 

delay  errors  in  more  detail  using  a  graphical  technique.  In 

this  figure  we  have  shown,  as  a  function  of  e*  and  eS,  the 

g  g 

boundary  of  those  errors  that  will  lead  to  the  correct 
ambiguity  predictions.  (This  boundary  is  the  central  diamond 
shaped  region.)  If  the  errors  in  the  group  delays  fall  out¬ 
side  of  this  region  (see,  for  example,  point  labeled  A),  then 
the  phase  delay  ambiguities  will  be  incorrectly  estimated. 

When  the  errors  in  the  group  delays  are  computed  from  predict¬ 
ed  phase  delay  ambiguities,  the  apparent  errors  will  fall 
within  the  central  region  (see  point  A').  Consequently, 
independent  of  the  original  errors  in  the  group  delays,  the 
apparent  errors  will  always  be  inside  the  small  central 
region.  Figure  4.2.2  will  be  useful  when  we  are  analyzing 
data  because  it  will  give  us  a  convenient  means  for  determin¬ 
ing  the  relationship  between  the  actual  errors  in  the  group 
delays  and  the  apparent  errors  calculated  from  the  phase 
delays  (if  the  phase-delay  ambiguities  are  determined  solely 
from  the  group  delays) . 

So  far  we  have  been  discussing  the  X-  and  S-band  phase 
delays  separately.  However,  if  we  want  to  benefit  from  the 
precision  of  the  phase  delays,  we  should  compute  the  non-dis- 
persive  delay  from  the  phase  delays.  Again,  we  will  be  inter- 


Figure  4.2.2  Transformation  of  actual  group  delay  errors  to  apparent  group 
delay  errors 
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ested  in  the  effects  of  ambiguities  in  the  phase  delays  on  the 
estimate  of  the  non-dispers ive  delay.  Firstly,  we  will 
examine  the  expressions  for  the  non-dispersive  delay  estimates 
computed  from  the  group  delays.  From  Equations  (4.2.12),  we 
have  for  the  group  delays 

<Vgg  *  Tn  +  •  (4.2.26) 

The  expression  for  the  non-dispersive  delay  estimated  from  the 
phase  delays  may  be  derived  in  a  manner  similar  to  that  used 
to  derive  Equation  (4.2.12).  The  result  is 

(Vpp  =  Tn  +  1*08  ( 2  n aN x / (jjx  +  e* )  -  0.08(2nANs/Ws+^). 

(4.2.27) 

We  should  notice  in  Equation  (4.2.27)  that  (Tn)*p  has  two 

ambiguity  spacings,  one  of  0.13  nsec  due  to  the  X-band  phase 

delay  ambiguity,  and  the  other  of  0.035  nsec  due  to  the  S-band 

phase  delay  ambiguity.  This  latter  ambiguity  spacing  clearly 

indicates  the  need  for  a  reliable  prediction  of  the  S-band 

phase  delay  ambiguities.  The  effect  of  an  S-band  ambiguity 

(0.035  nsec)  would  be  difficult  to  detect  in  the  postfit 

residuals  from  a  geodetic  solution  using  (t_)__  because  the 

J  n  pp 

errors  in  the  geodetic  models  for  the  delays  have  RMS  scatters 
of  *0.090  nsec  (see  Chapter  5). 

In  anticipation  of  the  results  which  will  be  presented  in 
Section  4.3,  in  the  remainder  of  this  section  we  discuss  the 
effects  of  a  number  of  error  types  on  the  differences  between 
the  various  delay  measurements  and  their  predicted  values. 
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We  now  investigate  the  detection  of  errors  in  both  the 
group  and  phase  delays,  by  comparing  the  differences  between 
Equations  (4. 2. 25. a)  and  (4.2.25.b)  and  Equations  (4.2.26)  and 
(4.2.27).  In  view  of  the  results  which  will  be  presented  in 
the  next  section,  we  will  discuss  two  classes  of  systematic 
errors  in  the  observations.  Firstly,  we  consider  the  case 
when  the  S-band  group  delay  measurements  have  a  systematic, 
time  dependent,  error,  and  all  of  the  other  measurement  errors 
are  assumed  to  be  zero.  We  see  from  Equation  (4. 2. 25. a)  that 
the  differences  between  observed  X-band  group  delays  and  their 
predicted  values  from  the  phase  delays  will  be  unaffected  by 
such  an  error  (i.e.,  eo=£d=eds°) •  The  S-band  group  delay  dif- 

a  r  r 

ferences  will  be  non-zero,  and  will  equal  the  errors  in  the 
S-band  group  delays.  The  non-dispersive  delay  estimates  will 
differ  by  -0.08  times  the  errors  in  the  S-band  group  delays. 

We  can  easily  detect  this  error  because  the  X-band  group  delay 
predictions  will  agree  with  the  observed  values.  Similarly, 
if  the  X-band  group  delays  are  in  error  (but  not  the  S-band 
group  delays) ,  then  the  X-band  group  delay  predicted  and 
observed  values  would  differ.  In  this  case,  the  S-band  group 
delay  predictions  would  agree  with  the  predicted  values,  hence 
allowing  the  X-band  group  delays  to  be  isolated  as  the  delay 
measure  with  an  error. 

Thus  far,  we  have  considered  the  phase  delays  to  be  error 
free  because  of  the  »40-fold  smaller  uncertainties,  and  their 
insensitivity  to  instrumental  errors  (see  Section  3.1). 
However,  there  are  certain  errors  which  will  affect  the  group 
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and  phase  delays  equally.  Such  an  error  can  arise  in  several 
different  circumstances.  For  example,  all  dual-band  observa¬ 
tions  made  at  Onsala,  Sweden,  use  two  radio  telescopes,  one 
for  receiving  X-band  signals  and  the  other,  *600  m  away  from 
the  X-band  antenna,  for  receiving  S-band  signals.  In  this 
case,  the  S-band  observations  are  corrected  to  the  X-band 
radio  telescope's  position,  using  the  coordinate  differences 
between  the  two  radio  telescopes.  Any  errors  in  the  relative 
positions  of  the  S-  and  X-band  antennas  will  introduce  an 
error  in  the  S-band  delays  referred  to  the  X-band  telescope's 
position.  In  addition,  any  differences  in  the  atmospheric 
delay  between  the  two  antennas  will  introduce  a  similar  type 
error.  If  we  consider  Equations  (4.2.25)  we  see  that  if  the 
S-band  group  and  phase  delays  have  equal  errors  then  the 
differences  between  the  observed  and  predicted  values  of  the 
group  delays  at  X-  and  S-band  will  be  non-zero  and  equal. 
However,  if  we  examine  the  expressions  for  the  non-dispersive 
delays  (Equations  (4.2.26)  and  (4.2.27))  we  see  that  this 
error  will  cause  no  difference  between  the  estimates  of  the  . 
non-dispersive  delay  from  the  group  and  phase  delays.  Again, 
based  on  these  properties  we  should  be  able  to  detect  this 
type  of  error. 

Of  course,  with  actual  measurements,  all  of  the  delay 
measurements  will  have  some  amount  of  error,  and  this  will 
complicate  the  isolation  of  errors  when  we  compare  predictions 
and  observed  values  of  the  various  delay  measures.  We  will 
investigate  the  magnitude  of  these  errors  by  testing  our 
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ambiguity  elimination  techniques  on  data  which  can  be  phase 
connected  without  the  use  of  group  delays. 


4.3  Results  from  "phase-connection”  experiments 

When  the  studies  of  predicting  phase  delay  ambiguities 
were  started,  it  was  hoped  that  a  technique  could  be  developed 
which  would  allow  phase  delays  to  be  used  in  geodetic  experi¬ 
ments.  The  initial  studies  soon  indicated  that,  although  the 
Mark  III  system  could  theoretically  measure  sufficiently 
accurate  group  delays  to  allow  reliable  ambiguity  prediction, 
the  actual  measurements  were  of  insufficient  accuracy.  Conse¬ 
quently,  the  studies  on  phase-delay-ambiguity  prediction,  have 
been  used  instead  to  study  the  quality  of  the  group  delays. 

In  this  section,  we  will  therefore  concentrate  on  errors  in 
the  group  delays  which  have  been  estimated  by  computing  esti¬ 
mates  of  the  group  delays  from  the  phase  delays. 

The  techniques  used  to  carry  out  the  ambiguity  predic¬ 
tions  were  explained  in  Section  4.2.  We  demonstrate  the 
application  of  these  techniques  in  Figure  4.3.1.  This  figure 
shows  the  differences  between  the  predicted  and  the  phase- 
connected  phase  delays  at  both  X-  and  S-band  for  the 
Ef felsberg-Onsala  interferometer.  One  aspect  of  this  figure 
is  immediately  apparent.  The  differences  between  the  predict¬ 
ed  and  the  phase-connected  phase  delays  at  X-  and  S-band  are 
not  randomly  distributed.  They  show  systematic  trends  at  both 
X-  and  S-band.  We  should  also  notice  that  the  drift  between 


Figure  4.3.1  Differences  between  the  predicted  and  the  phase  connected  phase  delays  for 
observations  of  the  radio  source  3C  345  on  the  Ef felsberg-Onsala  baseline 
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the  predictions  and  the  phase-connected  phase  delays  is  suffi¬ 
ciently  large  that,  without  the  frequent  observations  of  this 
source  which  were  available  in  these  experiments,  the  phase 
delay  ambiguities  would  have  been  incorrectly  eliminated.  If 
we  now  compare  the  observed  group  delays  with  their  predicted 
values  (from  the  phase  delays)  we  see  the  reason  for  the 
systematic  differences  of  the  phase  delay  predictions  (Figure 
4.3.2).  These  differences  will  be  called  the  "apparent  group 
delay  errors."  (We  use  the  word  "apparent"  because  these 
differences  could  be  due  to  remaining  phase  delay  ambiguities 
or  errors  in  the  phase  delays,  in  addition  to  group  delay 
errors.  However,  any  ambiguity  errors  will  cause  discontinu¬ 
ities  in  the  systematic  trends.) 

Most  of  the  variations  in  the  X-band  group  delays  can  be 
explained  by  the  effects  of  the  spurious  signal  in  the  Onsala 
X-band  receiver  (see  Section  3.2).  We  show  in  Figure  4.3.3 
the  X-band  apparent  group  delay  errors  after  correcting  for 
the  presense  of  the  spurious  signal.  This  correction  reduces 
the  apparent  errors,  but  there  are  still  some  small  (*0.03  * 

nsec)  differences.  The  apparent  S-band  group  delay  errors  can 
not  be  explained  by  the  effects  of  a  spurious  signal. 

In  an  attempt  to  determine  the  origin  of  these  differ¬ 
ences,  we  show  in  Figure  4.3.4,  the  phase  calibration  contri¬ 
butions  to  the  X-  and  S-band  group  delays  at  Effelsberg  and 
Onsala.  Clearly,  we  cannot  account  for  the  S-band  differences 


with  phase  calibration  errors.  Nor  can  the  difference  be  due 
to  an  error  in  the  relative  positions  of  the  two  radio  tele- 


3.2  Differences  between  the  observed  and  predicted  group  delays  for  the  data 


Figure  4.3.3  Differences  between  the  X  band  observed  and  predicted 
group  delays  for  the  Effelsberg-Onsala  baseline  after 
correcting  for  the  phase  calibration  error 
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scopes  at  Onsala  (see  discussions  in  Section  4.2) ,  because  if 
the  differences  were  due  solely  to  an  error  in  the  relative 
positions  of  the  telescopes,  then  the  apparent  X-band  group 
delay  errors  would  equal  the  apparent  S-band  group  delay 
errors.  At  this  time  the  origin  of  the  S-band  differences  is 
unknown.  We  will  discuss  these  differences  further  in  Section 
4.4. 

The  Ef f elsberg-Onsala  interferometer  is  not  the  only 
one  to  show  systematic  differences  between  the  group  delays 
and  their  predicted  values  from  the  phase  delays.  In  Figure 
4.3.5,  we  show  the  apparent  group  delay  errors  at  X-  and  S- 
band  for  the  several  other  interferometesr  involved  in  the 
July  1980  experiment.  (Note  that  the  differences  are  plotted 
on  different  scales  for  X-  and  S-band,  and  within  each  band  we 
use  two  different  scales  depending  on  the  size  of  the  differ¬ 
ences.) 

For  all  baselines  the  weighted-root-mean-sqaure  (WRMS) 
scatter  of  the  apparent  group  delay  errors  are  greater  than 
those  expected  from  the  signal- to-noise  ratio  of  the  observa¬ 
tions.  Of  course,  there  will  be  apparent  errors  in  group 
delays  if  there  are  phase-connection  errors.  However,  we  must 
remember  that  changing  the  phase  delay  ambiguities  at  one 
frequency  will  change  the  apparent  errors  in  the  group  delays 
at  both  X-  and  S-band  (see  Section  4.2).  Consequently,  it 
becomes  necessary  to  check  the  two  group-delay  differences 
when  considering  a  phase-delay-ambiguity  error  as  a  possible 
cause  for  an  apparent  group  delay  error.  In  addition,  the 


A. 3. 5  Apparent  group  delay  errors  in  two  bands  for  several  Interferometers  for  the 
July  1980  experiment 
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Figure  4.3.5  Continued 
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phase  delays  used  to  predict  the  group  delays,  satisfy  a 
closure  condition,  i.e.,  they  sum  to  approximately  zero  (and 
much  less  than  an  ambiguity)  around  closed  triplets  of  base¬ 
lines  when  the  phase  delays  are  referred  to  the  same  epoch 
(see  Section  3.3).  Hence,  if  the  phase-delay  ambiguities  are 
changed  on  one  baseline,  they  must  be  changed  on  at  least  one 
other  baseline  to  ensure  that  the  closure  conditions  remain 
satisfied. 

We  now  consider  the  differences  for  several  of  the  base¬ 
lines  shown  in  Figure  4.3.5.  We  firstly  consider  the  apparent 
S-band  group  delay  errors  for  the  Haystack-Onsala  interfer¬ 
ometer.  We  see  in  this  case  that  for  the  last  3  hours  of  the 
experiment  the  group  delays  seem  to  be  in  error  by  0.510  nsec 
which  suggests  an  S-band  phase  connection  error.  However, 
when  we  plot  the  group  and  phase  delay  closure  errors  for  a 
triplet  of  baselines  involving  Haystack  and  Onsala  (Figure 
4.3.6),  we  see  immediately  that  the  group  delays  have  closure 
errors  which  are  remarkably  similar  to  the  Haystack-Onsala 
apparent  group  delay  errors.  The  phase  delays,  however,  show 
very  small  closure  errors.  Hence,  even  before  we  attempted  to 
predict  group  delays  from  phase  delays,  we  should  have 
expected  large  differences  based  on  the  group-delay  closure 
errors . 

One  other  case  from  Figure  4.3.5  which  we  will  examine  is 
the  apparent  group-delay  errors  for  the  Ft.  Davis-Owens  Valley 
interferometer.  We  notice  that  in  this  case  the  apparent 
group  delay  errors  seem  to  have  a  bimodal  distribution. 


especially  at  S-band.  This  type  of  pattern  would  be  expected 
if  there  were  remaining  phase  delay  ambiguities.  However,  the 
phase  delay  postfit  residuals  at  X-  and  S-band  for  this  base¬ 
line  do  not  show  any  apparent  phase  connection  errors  (Figure 
4.3.7).  (In  this  case,  the  group  delay  closure  errors  have 
too  much  scatter  to  allow  the  detection  of  any  bimodal 
errors . ) 

We  have  compiled  the  statistics  of  the  apparent  group 
delay  errors  (in  a  manner  similar  to  that  used  in  Chapter  3), 
and  these  results  are  shown  in  Figure  4.3.8.  We  have  plotted 
as  a  function  of  the  expected  scatter  of  the  group  delay 
errors,  the  normalized  RMS  scatter.  We  have  shown  these  plots 
for  both  the  July  1980  and  June  1981  experiments.  The  normal¬ 
ized  RMS  plot  shows  trends  which  are  similar  to  those  for  the 
residual  phases  (see  Figures  3.1.12  and  3.1.13),  i.e.,  there 
appears  to  be  an  overall  scaling  factor  of  approximately  1.1 
at  X-band  and  1.2  at  S-band,  and  performance  limits  of  40 
psec  and  14  psec  at  X-band  for  the  July  1980  and  June  1981 
experiments,  respectively,  and  150  psec  at  S-band  for  both 
July  1980  and  June  1981. 

There  appears  to  be  a  large  improvement  in  the  X-band 
system  performance  between  these  two  experiments  which  is 
probably  due  to  the  smaller  spurious  signals  during  the  June 
experiment.  (Remember  that  at  Haystack  even  after  correcting 
for  the  presence  of  the  spurious  signal,  there  were  residual 
trends  in  the  contribution  of  the  calibration  phases  to  the 
group  delays.  See  Section  3.2  for  discussion.)  The  S-band 


Figure  4.3.7  Ft.  Davis-Owegs  Valley  Interferometer  phase  delay  postflt  residual  at 
X-  and  S-band 
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Figure  4.3.8  Continued 


system  performance  does  not  seem  to  have  improved  between  the 
two  experiments. 


The  performance  threshold  at  S-band  implies  that  we 
cannot  predict  phase  delay  ambiguities  from  group  delays  even 
with  infinite  signal- to-noise  ratio  observations  at  S-band. 
(See  Section  4.2;  the  upper  bound  in  the  S-band  uncertainty 
for  phase  delay  predictions  was  0.09  nsec.) 

We  have  yet  to  discuss  the  possible  origins  of  the  syste¬ 
matic  trends.  We  leave  this  discussion  until  we  have  consid¬ 
ered  the  apparent  group  delays  from  a  geodetic  experiment. 

4.4  Results  for  geodetic  experiments;  Ef felsberg-Onsala 

We  saw  in  Section  4.3,  that  when  we  evaluated  the 
apparent  group-delay  errors  from  the  phase-connected  experi¬ 
ments,  the  performance  of  the  S-band  system  was  not  good 
enough  to  allow  the  prediction  of  the  phase-delay  ambiguities 
from  the  group  delays.  However,  in  view  of  the  systematic 
trends  seen  in  the  apparent  group-delay  errors,  we  have 
attempted  to  predict  phase-delay  ambiguities  from  the  group 
delays  for  the  Ef felsberg-Onsala  interferometer,  in  order  to 
gain  a  better  understanding  of  these  trends. 

We  chose  to  analyze  the  July  26th  1980,  geodetic  experi¬ 
ment  because  this  experiment  had  the  largest  number  of 
observations  of  any  24hr  experiment  involving  these  two 
antennas.  We  show  in  Figure  4.4.1  the  apparent  group  delay 
errors  at  X-  and  S-band.  (We  have  plotted  each  figure  twice. 


Figure  4*4.1  Continued 
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once  with  one-standard-deviation  error  bars,  the  second  time 
without  error  bars.)  WO  see  in  these  figures  the  same  types 
of  systematic  trends  which  were  observed  in  the  phase 
connected  experiments.  Again,  the  S-band  results  show  much 
greater  trends  than  the  X-band  results.  The  normalized  RMS 
scatter  of  the  X-band  results  is  3.4,  which  indicates  that 
their  scatter  is  not  in  accord  with  their  statistics. 

we  have  shown  these  results  because  they  give  us  some 

* 

insights  into  the  origin  of  the  systematic  trends.  There  are 
several  important  properties  of  this  origin  which  can  now  be 
deduced  from  the  geodetic  results.  Firstly,  we  notice  that  a 
major  part  of  the  apparent  group  delay  errors  at  S-band  is 
independent  of  the  radio  source  being  observed.  We  may  there¬ 
fore  conclude  that  the  differences  are  most  probably  not 
caused  by  the  brightness  distributions  of  the  radio  sources 
(see  Section  3.3) .  Also,  since  the  radio  telescopes  look  in 
many  different  parts  of  the  sky  in  rapid  succession,  the  major 
portion  of  the  trends  is  probably  not  due  to  antenna 
deformation.  In  Figure  4.4.2,  we  show  the  elevation  angles  at 
Effelsberg  and  Onsala  for  these  observations.  From  this 
figure  we  see  that  observations  are  taken  at  many  different 
elevation  angles  in  a  short  period  of  time.  (In  each  of  these 
cases  some  portion  of  the  error  is  may  be  due  to  these  causes 
(i.e.,  source  structure  and  antenna  deformations),  because  the 
scatter  of  errors  around  smooth  curves  is  greater  than  would 
be  expected  from  the  variances  of  the  observations.) 

The  apparent  group  delay  errors  seem  to  depend  mainly  on 
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ire  4.4.2  Elevation  angles  at  Effelsberg  and  Onsala  for  the  observations 
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time,  and  hence  it  is  likely  that  they  arise  from  the  Mark  III 
system  itself.  What  could  be  the  origin  of  the  errors?  We 
saw  in  Section  3.1  that  the  Mark  III  dispersion  depends  on 
time  and  we  found  lower  bounds  on  the  variations  of  8  psec/day 
at  X-band  and  36  psec/day  at  S-band.  But  we  stressed  in  that 
section  that  these  are  lower  bounds  because  we  cannot  estimate 
the  full  delay  variations  from  the  residual  phases.  At  this 

time  it  seems  that  variations  of  the  Mark  III  dispersion  with 

* 

time  are  the  most  likely  cause  of  the  systematic  variations  of 
the  apparent  group  delays. 

It  is  important  at  this  time  that  we  put  the  systematic 
variations  of  the  S-band  group  delays  into  perspective.  In  a 
geodetic  data  analysis,  the  S-band  group  delays  are  used  only 
to  calculate  the  ionospheric  correction.  Consequently,  the 
contribution  of  their  errors  to  the  non-dispersive  delays  are 
12.5  times  smaller  than  their  errors  and,  hence,  the  threshold 
performance  limit  of  150  psec  at  S-band,  results  in  the  thres¬ 
hold  limit  of  »12  psec  for  non-dispersive  delays.  When  we 
compare  this  limit  with  the  threshold  limits  of  the  X-band 
group  delays  (40  psec  in  July  1980  and  14  psec  in  June  1981)  , 
we  see  that  the  X-band  group  delay  errors  are  of  equal,  or 
more  importance,  than  the  S-band  errors. 


We  have  now  discussed  the  theory  and  the  operational 
performance  of  the  Mark  III  VLBI  system.  While  the  per¬ 
formance  of  the  system  is  not  so  good  as  expected  (see 
Chapters  3  and  4) ,  we  have  not  detected  any  major  errors  in 
the  measurements  made  with  the  system.  We  now  discuss  some  of 

the  geodetic  results  which  have  been  obtained  with  this  system 

* 

and  the  Mark  I  VLBI  system. 

This  chapter  will  consist  of  three  papers  --  two  of  which 
have  already  been  published  and  one  of  which  is  currently 
being  prepared  for  publication.  For  those  readers  unfamilar 
with  the  locations  and  names  of  the  radio  telescopes  which 
have  been  used  in  geodetic  VLBI  experiments,  this  information 
is  given  in  Table  5.1.  The  telescope  locations  are  also  shown 
in  Figure  5.1.  The  IAU  (International  Astronomical  Union) 
convention  and  the  common  names  of  the  radio  sources  discussed 
here  and  elsewhere  in  the  thesis  are  given  in  Table  5.2. 

The  first  paper  we  present,  entitled  "Geodesy  by  radio  f 
interferometry:  Intercontinental  distance  determinations  with 
subdecimeter  precision",  was  published  in  1981  and  discusses 
the  baseline  length  measurements  from  radio  telescopes  in 
North  America  to  a  radio  telescope  in  Onsala,  Sweden,  which 
were  made  with  the  Mark  I  VLBI  system.  When  this  paper  was 
published  we  thought  that  these  length  measurements  were  of 
subdecimeter  precision.  Soon  after  dual  frequency  band 
observations  to  Onsala  were  analysized,  it  became  clear  that 
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Table  5.1  Radio  telescope  descriptions  and  names 


Observatory  Name* 

North  America 
Haystack  HAYSTACK 

Observatory 

Westford  WESTFORD 

Observatory 
Harvard  HRAS  085 

Radio  Astron¬ 
omy  Station 
Owens  Valley  OVRO  130 
Radio  Obser¬ 
vatory 

National  NRAO  140 

Radio  Astron¬ 
omy  Obser¬ 
vatory 


Operated  by 

Northeast  radio 
Observatory  Cor¬ 
poration  (NEROC) 
NEROC 

Harvard  University 

California  Insti¬ 
tute  of  Technology 

National  Radio 
Astronomy  Obser¬ 
vatory  (NRAO) 


Location  Antenna 
_ Diameter 

Westford,  Ma.  37  m 

westford,  Ma.  26  m 

* 

Ft.  Davis,  Tx.  26  m 

Big  Pine,  Ca.  40  m 

Green  Bank,  Wv.  43  m 


Eupope 

Onsala  Space  ONSALA60  Chalmers  Institute  Onsala,  Sweden  20  m 

Observatory  (X-band)  of  Technology 

ONSALA85  26?m 

(S-band) 

Effelsberg  EFLSBERG  Max  Plank  Institut  Bonn,  Federal  100  m 

Observatory  fur  Radio-  Republic  of 

astronomie  Germany 

Chilbolton  CHLBLTON  Appleton  Chilbolton,  26  m 

Observatory  Laboratories  England 

*  These  names  will  be  used  on  the  computer  generated  plots. 


Locations  of  radio  telescopes 
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Table  5.2  IAU  convention*  and  common  names  for  VLB  I  radio 
sources 


Common  name 

IAU  name 

4C  67.05 

0224+671 

NRAO  150 

0355+508 

OJ  287 

0851+202 

4C  39.25 

0923+392 

OK  290 

0953+254 

3C  27  3B 

1226+023 

OQ  208 

1404+286 

NRAO  512 

1638+398 

3C  345 

1641+399 

VR  422201 

2200+420 

3C  454.3 

2251+158 

*The  IAU  (International  Astronomical  Union)  names  are  defined 
in  the  form  of  rrhhtdd.d,  where  rrhh  are  the  hours  and  minutes 
of  the  right  ascension  of  the  source,  and  dd.d  is  the  declina¬ 
tion  of  the  source  in  decimal  degress. 
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we  had  grossly  underestimated  the  effects  of  the  ionosphere 
delay  on  the  single  frequency  band  (X-band)  Mark  I  observa¬ 
tions.  Although,  at  the  time  we  also  thought  that  some  of  the 
differences  (between  the  Mark  I  and  the  Mark  III  baseline 
length  estimates)  could  have  been  due  to  the  change  in  the 
equipment  or  to  the  ionospheric  delay  correction  computed  from 
the  two  antennas  at  Onsala  (see  Section  4.2) . 

We  learned  a  valuable  lesson  on  the  effects  of  un-modeled 

# 

systematic  errors  from  this  experience.  This  lesson  prompted 
the  detailed  analysis  of  the  baseline  length  estimates  which 
are  discussed  in  the  second  paper  in  this  chapter  —  "Geodesy 
by  radio  interferometry:  Precision  and  accuracy  of  inter¬ 
continental  distance  measurements." 

The  third  paper  we  present  discusses  the  determination  of 
the  tidal  parameters  of  the  earth  using  VLBI.  The  paper  is 
entitled  "Determination  of  the  tidal  parameters  of  the  earth." 
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GEODESY  BY  RADIO  INTERFEROMETRY : 

INTERCONTINENTAL  DISTANCE  DETERMINATIONS  WITH  SUBDEC1HETER  PRECISION 

T.  A.  Herring,1  B.  £.  Corey,1  C.  C.  Counselman  III,'  I.  I.  Shapiro,'*’ 

B.  0.  Running,'  0.  E.  H.  Rydbeek,'  T.  A.  Clark,*  R.  J.  Coates,' 

C.  Ha,'  J.  W.  Ryan,'  N.  fi.  Vandenberg,’  H.  F.  Hlnteregger ,* 

C.  A.  Knight,'  A.  E.  E.  Rogers,'  A.  R.  Whitney,' 

D.  S.  Robertson,*  and  B.  R.  Schupler* 


Abstract.  Analysis  of  very-long-baseline 
Interferometer  (VLBI)  observations  yielded  es¬ 
timates  of  the  distances  between  three  radio 
telescopes  in  the  United  States  and  one  in 
Sweden,  with  formal  standard  errors  of  a  few 
centimeters:  Westford,  Hassachusetts-Onsala , 
Sweden:  5, 599, 7 Ik. 66+0. 03m;  Green  Bank,  West 
Virginia-Onsala,  Sweden:  6,319,317.75t0.O3m;  and 
Owens  Valley,  California-Onsala,  Sweden: 

7 , 91k ,  1 31 . 19±0.04m,  where  the  earth-fixed  refer¬ 
ence  points  are  defined  in  each  case  with  re¬ 
spect  to  the  axes  of  the  telescopes.  The  actual 
standard  errors  are  difficult  to  estimate  re¬ 
liably  but  are  probably  not  greater  than  twice 
the  formal  errors. 

1.  Introduction 

Very  long  baseline  interferometry  (VLBI)  has 
been  used  for  the  past  decade  to  determine 
distances  between  radio  telescopes  and  positions 
of  radio  sources.  Improvement  in  the  precision 
of  baseline  length  determination  in  North 
America  over  this  period  has  been  substantial  — 
from  a  precision  of  about  1  m  [Hinteregger  et 
al.,  1972]  to  a  precision  of  under  five  centi¬ 
meters  [Robertson  et  al.,  1979]  for  baselines 
from  about  1000  km  to  Just  under  4000  km  in 
length.  Near  the  middle  of  the  decade,  dis¬ 
tances  between  North  America  and  Europe  were 
determined  with  a  precision  of  about  half  a 
meter  [Robertson,  1975;  see  also  Cannon  et  al., 
1979].  In  thi3  paper  we  describe  more  recent 
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VLBI  determinations,  with  subdecimeter  preci¬ 
sion,  of  these  intercontinental  distances. 

2.  Data  Analysis 

The  data  analyzed  consisted  of  interfero¬ 
metric  group  delays  [Shapiro,  1976]  which  were 
obtained  from  three  sessions  of  observation 
involving  up  to  four  antennas,  as  described  in 
Table  1.  All  observations  were  made  with  the 
Hark  I  VLBI  system  with  a  center  radio  frequency 
of  about  8  GHz.  A  multichannel  bandwidth  syn¬ 
thesis  technique  [Whitney  et  al.,  1976]  was 
used.  Each  recorded  channel  had  the  Mark  I 
bandwidth  of  360  kHz,  whereas  the  synthesized 
bandwidth  was  v100  MHz  in  the  first  session  of 
observations  and  u>300  MHz  in  the  last  two  ses¬ 
sions.  A  hydrogen  maser  frequency  standard  of 
modern,  post-1970,  design  was  used  at  each 
telescope  for  each  session. 

The  Cartesian  coordinate  system  used  in  the 
analysis  was  geocentric  and  earth-fixed  with  the 
Z  axis  parallel  to  the  mean  pole  of  rotation  of 
1900-1905,  as  defined  by  the  International 
Latitude  Service  and  maintained  by  the  Bureau 
International  de  l'Heure  (BIH).  The  X  axis  was 
defined  to  be  perpendicular  to  the  Z  axis  and  in 
the  direction  of  the  Greenwich  meridian.  The  Y 
axis  completed  the  right-handed  triad.  Opera¬ 
tionally,  the  origin  of  this  system  was  defined 
by  the  coordinates  for  the  intersection  of  the 
azimuth  and  elevation  axes  of  the  Haystack  radio 
telescope,  obtained  from  a  combination  of  space- 
craft-tracking  and  VLBI  observations  made  at 
various  sites.  The  orientation  was  defined  *by 
the  BIH  values  for  pole  position  and  UT1  (1968 
system),  with  the  addition  of  fortnightly  and 
monthly  terms  of  small  amplitude  [Woolard, 

1959).  and  by  the  models  for  diurnal  polar 
motion  [McClure,  1973]  and  earth  tides  (based  on 
ephemeris  positions  of  the  moon  and  sun,  on  the 
Love  numbers  h  «  0.61  and  l  *  0.09,  and  on  the 
assunption  of  no  dissipation).  The  small  ef¬ 
fects  of  antenna  deformations  [Carter  et  al., 
1980],  ocean  loading,  plate  tectonics,  and  other 
errors  in  the  model  were  ignored  in  view  of  the 
limited  precision  of  the  measurements  being 
analyzed. 

The  orientation  of  this  earth-fixed  coordi¬ 
nate  system  in  inertial  space  (with  respect  to 
the  mean  equinox  and  equator  of  1950.0),  was 
defined  by  the  pole  position,  by  UT1  and  by  the 
standard  formulas  for  sidereal  time,  nutation, 
and  precession,  with  certain  small  corrections: 
Woolard's  [1953]  theory  as  modified  by  Melchior 
[1971]  was  used  for  nutation;  Bnd  5,027'.'878  per 
tropical  century  at  1950.0  was  taken  for  the 
precession  constant.  The  origin  of  right  ascen¬ 
sion  was  defined  by  the  value  12  hr  26  min 
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Observation 

Session 

Dates 

Duration , 
hours 

Ant  er.rias* 

Number  of 
Sources! 

Declination  Range 
of  Sources, 
deg 

Number  of 
Group-Delay 
Observations** 

1 

Sept 

.  21-25,  1977 

88tt 

H.N.O 

10(6) 

-5.5  to  50.8 

529  (151) 

2 

Feb. 

24-26,  1976 

50 

H.O.V 

6(4) 

0.5  to  50.6 

291  (77) 

3 

May 

17-19,  1978 

45 

h.o.n.v 

10(10) 

-5.5  to  50.8 

830  (258) 

*  H  ;  37-m  diameter  telescope  of  the  Haystack  Observatory,  Westford,  Massachusetts;  N  =  43-m 

diameter  telescope  of  the  National  Radio  Astronomy  Observatory  (NRAO),  Green  Bank,  West  Virginia; 
0  £  20-m  diameter  telescope  of  the  Cnsala  Space  Observatory,  Onsala,  Sweden;  and  V  £  40-m 
diameter  telescope  of  the  Owens  Valley  Radio  Observatory,  Big  Pine,  California, 
t  The  sources  observed  are  listed  lr.  the  caption  to  Figure  1.  The  numbers  of  sources  observed  at 
Onsala  are  given  in  parentheses. 

•*  The  numbers  of  observations  involving  Onsala  are  given  in  parentheses, 
it  Onsala  participated  for  the  last  62  hours. 


33.246  sec  for  the  radio  source  3C  273B  (ellip¬ 
tic  aberration  removed). 

Using  this  framework  and  a  theoretical  model 
for  the  group  delays  [Robertson,  1975;  Ma, 

1976],  we  estimated  by  weighted  least  squares 
the  baseline  vectors,  the  source  positions,  and 
the  other  relevant  parameters.  Included  in  the 
last  category  are  parameters  that  allow  for  the 
estimation  of  changes  in  pole  position  and 
variations  in  UT1  when  data  from  two  or  more 
sessions  of  observations  are  combined  [Shapiro 
et  al.,  197*.  1976;  Robertson  et  al.,  1979).  In 
addition  to  these  purely  geometric  parameters 
there  are  parameters  in  the  model  that  represent 
the  effects  on  the  group  delays  of  the  propaga¬ 
tion  medium  and  of  the  behavior  of  the  clocks  at 
the  various  radio  telescopes.  The  representa¬ 
tion  of  the  propagation  median  Includes  the 
zenith  tropospheric  delay  as  a  parameter.  Since 
the  signal  delay  through  the  troposphere  varies 
with  time  and  with  location,  a  number  of  such 
parameters  are  utilized  with  this  model.  The 
effects  of  the  ionosphere,  always  under  v>1  ns  in 
the  zenith  direction,  but  not  separately  model¬ 
ed,  are  largely  absorbed  by  these  parameters 
because  of  the  similarity  in  the  signature  in 
the  group  delays  of  the  ionosphere  and  the 
troposphere.  The  representation  of  the  behavior 
of  the  clock  at  one  site  relative  to  that  at 
another  consists  of  a  polynomial  in  time  of  low 
order  with  the  coefficients  as  parameters,  the 
first  two  denoting  the  epoch  and  rate  offsets. 
Since  the  clocks  drift  unpredictably  with  time, 
a  number  of  such  polynomials  are  usually  employed. 

The  choice  of  the  number  of  parameters  to 
represent  the  combined  effects  of  the  tropo¬ 
sphere  and  the  ionosphere  and  the  behavior  of 
the  clocks  as  well  as  the  choice  of  the  time 
interval  of  applicability  of  each  parameter  are 
to  a  certain  extent  subjective.  We  therefore 
investigated  the  effects  on  the  results  of  a 
variety  of  such  choices.  As  extremes  for  the 
representation  of  the  troposphere  and  the  iono¬ 
sphere,  we  considered  for  each  telescope  the  use 
of  one  parameter  for  the  zenith  delay  for  each 
session  of  observations  and  one  such  parameter 
for  each  12-hour  period  of  observations.  For 
the  clock  behavior  we  chose  the  clock  at 


Haystack  as  the  reference  and  considered  for 
each  other  telescope  for  each  session  the  use  of 
one  parameter  each  for  the  relative  offsets  In 
epoch,  rate,  and  (In  several  instances)  rate  of 
change  of  rate.  The  intervals  of  applicability 
of  such  'clock  polynomials'  ranged  from  7.5  to 
24  hours;  only  for  this  latter  case  was  a  para¬ 
meter  Included  for  the  rate  of  change  of  rate. 
The  variations  in  the  estimates  of  baseline 
lengths  resulting  from  these  different  parame¬ 
ter!  zations  were  up  to  1.B  and  1.2  times  the 
corresponding  standard  deviation  obtained  for 
the  adopted  parameterization  for  the  data  from 
each  of  the  first  two  sessions  and  from  the 
third  session,  respectively.  In  view  of  this 
stability,  we  deem  it  unlikely  that  either  the 
propagation  medium  or  the  behavior  of  the  clocks 
has  Introduced  errors  in  baseline  length  much  in 
excess  of  the  formal  standard  deviations.  The 
adopted  choice  of  epochs  and  intervals  of 
applicability  for  these  parameters  (see  section 
4)  was  based  primarily  on  examination  of  the 
postfit  residuals,  especially  those  from  obser¬ 
vations  at  low  elevation  angles  for  the  para¬ 
meters  representing  the  behavior  of  the  propa¬ 
gation  medium. 

The  inverse  of  the  weight  for  each  group 
delay  was  obtained  from  the  sum  of  the  variance 
found  from  a  signal-to-noise  analysis  [Whitney, 
1974]  and  an  ad  hoc  term  Included  to  account  for 
error  sources  that  were  not  a  function  of  signal 
strength.  The  inclusion  of  this  term  was  moti¬ 
vated  by  the  desire  to  (1)  allow  for  the  effects 
of  systematic  errors,  and,  relatedly,  (2)  weight 
the  observations  of  the  different  sources  more 
evenly,  since  for  some  strong  sources  the 
signal-to-noise  analysis  yielded  uncertainties 
that  were  well  below  the  suspected  contributions 
of  other  errors.  The  magnitude  of  this  ad  hoc 
term  was  assumed  constant  for  the  data  for  a 
given  baseline  for  a  given  session  of  obser¬ 
vations.  Its  value  was  obtained  from  the  con¬ 
straint  that  chi-square  per  degree  of  freedom  be 
unity,  and  its  square  root  ranged  from  0.10  to 
0.25  ns.  Omitting  this  ad  hoc  term  from  the 
variances  changed  the  estimates  of  baseline 
length  by  up  to  1.2  times  the  standard  devia¬ 
tions  obtained  with  this  term  Included. 
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3.  Results 

Table  2  gives  our  estimates  of  the  distances 
between  the  radio  telescope  in  Sweden  and  the 
three  in  the  United  States.  The  reference  point 
for  each  telescope  Is  the  Intersection  of  the 
azimuth  and  elevation  axes,  except  for  the  NRA 0 
telescope  where  the  reference  is  the  point  on 
the  polar  axis  closest  to  the  (nonintersecting) 
equatorial  axis  (Hlnteregger  et  al..  1972]. 

Typical  postflt  residuals  from  the  simultan¬ 
eous  analysis  of  the  data  from  all  sessions  are 
shown  in  Figure  1.  The  root  mean  square  of 
these  residuals  ranged  from  0.13  ns  for  the 
Haystack-NRAO  data  from  the  last  session  to  0.49 
ns  for  the  Onsala-HRAO  data  from  the  first 
session,  the  ratio  being  approximately  as  ex¬ 
pected  in  view  of  (1)  the  threefold  smaller 
synthesized  bandwidth  used  In  the  first  session 
of  observations,  (2)  the  generally  larger  corre¬ 
lated  flux  densities  obtained  from  observations 
Involving  the  shorter  baselines,  and  (3)  the 
slightly  higher  sensitivity  of  the  Haystack-NRAO 
system  compared  to  the  Onspla-NRAO  system. 

The  repeatability  of  the  estimates  of  the 
baseline  lengths  shown  in  Table  2  Indicates  that 
the  formal  standard  errors  may  not  be  much 
smaller  than  the  actual  errors.  Howpver,  one 
fact  detracts  from  the  Importance  of  this  indi¬ 
cation:  The  uncertainties  of  the  estimates  of 
baseline  lengths  from  the  third  session  of 
observations  were  much  smaller  than  those  from 
the  second  which,  in  turn,  were  smaller  than 
those  from  the  first.  The  disparity  between  the 
uncertainties  from  the  first  and  second  sessions 
was  due  primarily  to  the  contrast  in  synthesized 
bandwldths.  The  further  discrepancy  between  the 
uncertainties  from  the  first  two  sessions  and 
those  from  the  third  was  due  mainly  to  geometry; 
the  schedule  of  observations  for  the  first  two 
sessions  was  controlled  by  requirements  of  other 
experiments  involving  only  the  U.S.  antennas  and 
therefore  did  not  properly  provide  for  Onsala's 
participation. 

The  estimates  for  the  propagation  delay 
through  the  atmosphere  and  ionosphere  In  the 
zenith  direction  ranged  from  6.7  ns  (NRAO)  to 
6.2  ns  (Onsala):  the  estimates  for  the  clock 
parameters  were  also  each  In  approximate  accord 
with  expectations,  ranging  up  to  23  ti»  for  epoch 
offsets  and  from  ^xIO'1*  to  *2x10"'*  for  rate 
offsets. 

The  estimates  for  the  source  positions,  the 
distances  between  the  radio  telescopes  In  the 
United  States,  and  the  pole  position  and  UT1 
variations  will  be  given  elsewhere  and  discussed 
there  along  with  a  much  larger  collection  of 
VLBI  observations  that  involved  only  the  tele¬ 
scopes  in  the  United  States. 

4.  Conclusions 

Combination  of  data  from  three  sessions  of 
VLBI  observations  Involving  sites  in  Sweden  and 
the  United  States  yielded  results  of  subdeci¬ 
meter  precision  In  the  determination  of  the 
intercontinental  distances.  Since  sufficiently 
accurate  and  well-distributed  observations  to 
determine  baselines  to  Sweden  with  subdecimeter 
precision  were  available  for  only  one  session, 
the  data  are  not  useful  for  the  establishment  of 
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Fig.  1.  Postfit  residuals  from  the  adopted  sol¬ 
ution  see  Table  2)  for  the  Haystack-Onsala 
group-delay  measurements  from  session  3  (see 
Table  1).  The  root  mean  square  of  these  resi¬ 
duals  is  0.25  ns.  The  vertical  bars  represent 
±1  standard  deviation.  Each  letter  in  the  fig¬ 
ure  denotes  a  different  radio  source:  A  =  NRAO 
150;  B  nc  81;  c  =  DC  39.25;  D  s  3 C  273B;  E  s 
3C  395;  F  =  VRO  92.22.01;  G  s  2139»00;  H  i  3C 
959.3;  and  I  5  OJ  287.  One  other  source,  ob¬ 
served  on  some  baselines  in  session  3.  was  3C 
279.  See  Clark  et  al.  [1976]  for  approximate 
values  of  the  coordinates  for  each  source. 

subdecimeter  repeatability.  Neither  are  the 
earlier  VLB1  determinations  [Robertson,  1975] 
useful  since  they  were  less  precise,  and,  in 
addition,  involved  a  different  telescope  at 
Onsala  which  has  yet  to  be  located  accurately 
with  respect  to  the  one  we  utilized  (Table  1). 
Further,  to  assess  inherent  accuracy,  compar¬ 
isons  would  best  be  made  with  results  from  a 
more  accurate  technique.  None  now  exists.  The 
most  accurate  alternate  technique  currently 
available  for  the  determination  of  these  parti¬ 
cular  Intercontinental  baselines  is  based  on 
observations  of  the  Doppler  shift  of  radio 
signals  transmitted  from  satellites,  plus  stan¬ 
dard  ground  surveys  to  tie  the  reference  points 
for  the  Doppler  measurements  to  those  for  the 
VIB1  determinations.  Such  Doppler  data  exist 
(W.  E.  Carter,  private  communication,  1979),  but 
the  analysis  and  the  ground  surveys  have  not 
been  completed  and  so  no  comparisons  can  yet  be 
made. 

Periodic  repetitions  of  these  intercontinen¬ 
tal  VIBI  measurements  are  also  planned  with  more 
radio  telescopes  in  the  interferometer  array  and 
with  use  of  our  new,  more  powerful,  Mark  III 
VLBI  system.  This  system,  among  other  attri¬ 
butes  records  data  simultaneously  from  two  well 
separated  radio  frequency  bands,  thus  allowing 
ionospheric  effects  to  be  virtually  eliminated 
as  a  source  of  error.  The  purpose  of  these 
further  VLBI  measurements  will  not  only  be  to 
check  on  the  repeatability  of  the  presort  base¬ 
line  determinations  but,  more  importantly,  to 
improve  the  precision  to  the  level  at  which  the 
expected  [Minster  and  Jordan,  1978]  *1.7  cm/yr 
spreading  rate  between  Europe  and  the  United 


Ctat-s  could  be  detected  reliably  within  a 
dec  ale . 
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Abstract 

The  precision  and  accuracy  of  intercontinental  baseline  length 
estimates  are  studied  through  a  number  of  repeatability  tests  on 
55  Mark  III  very-long-baseline  interferometry  (VLBI)  observing 
sessions  carried  out  between  November  1979  and  January  1982.  We 
also  present  results  from  the  analysis  of  a  further  61  VLBI  observing 
sessions  carried  out  between  January  1982  and  June  1983.  The 
estimate  of  the  Westford,  Massachusetts  to  Onsala,  Sweden  baseline 
length  obtained  from  the  ensemble  of  observing  sessions  has  a 
statistical  standard  deviation  of  0.5  cm.  The  weighted-root-mean-square 

(WRMS)  scatter  of  the  estimates  from  the  40  individual  observing 

r 

sessions  involving  these  sites  (each  of  approximately  24  hr  duration) , 
about  the  ensemble  mean  is  2.6  cm.  Studies  of  these  solutions 
indicate  that  the  actual  standard  deviation  of  the  baseline  length 
estimate  is  probably  more  in  accord  with  the  repeatability  rather 
than  with  the  statistical  formal  estimate.  The  estimate  of  the 
rate  of  change  of  the  length  of  this  baseline  is  1.6  ±  0.5  cm/yr, 
where  the  standard  deviation  of  the  rate  is  computed  from  the  formal 
uncertainty  of  the  rate  scaled  by  the  ratio  of  the  scatter  of  the 
baseline  estimates  to  the  expected  scatter  computed  from  the  standard 
deviations  of  the  length  estimates.  The  WRMS  scatter  of  the  40 
baseline  length  estimates  about  this  "best-fit"  straight  line  was 
2.0  cm.  Although  the  rate  of  change  of  the  baseline  length  is  over 
three  standard  deviations,  we  do  not  yet  claim  to  have  detected 
the  relative  motions  of  these  two  sites  because  our  studies  indicate 
that  the  VLBI  estimates  of  the  baseline  lengths  could  be  affected 
by  systematic  errors  at  the  level  of  the  apparent  change  in  the 
length. 

A  unexpected  conclusion  from  these  studies  is  that  the  precision 
of  the  baseline-length  estimates  from  solutions  which  incorporate 
low-elevation-angle  observations  (as  low  as  3?5)  is  better,  in 
general,  than  the  estimates  from  solutions  which  do  not  include 
observations  at  elevation  angles  below  10°. 


The  Mark  III  very-long-baseline  interferometry  (VLBI)  system 
was  introduced  in  April  1979  (Rogers  et  al..  1983) .  This  system 
was  designed  to  utilize  two  widely  separated  frequency  bands,  one 
each  near  2.3  GHz  and  8.3  GHz ,  so  that  the  dispersive  delay  due  to 
the  Earth's  ionosphere  could  be  calibrated  and,  hence,  eliminated 
as  a  major  error  source  in  the  estimation  of  the  distances  between 
the  radio  telescopes  that  form  the  interferometers.  The  first 
intercontinental  observing  session  to  partially  use  dual-frequency-band 
data  was  carried  out  in  November  1979,  but  only  six  hours  of  such 
data  were  obtained.  In  July  1980,  the  first  full  dual-frequency-band 
observing  session,  involving  North  American  (N. A. )  and  European 
sites,  was  carried  out.  Between  then  and  January  1983,  over  30,000 
dual-frequency  observations  have  been  made  in  observing  sessions 
sponsored  by  the  NASA  (National  Aeronautics  and  Space  Administration) 
Crustal  Dynamics  Project  and  the  National  Geodetic  Survey  (NGS) 
Polaris  program  (Polar-motion  Analysis  by  £adio  Interferometric 
Surveying) .  In  this  paper  we  discuss  the  analysis  of  VLBI  data 
obtained  between  November  1979  and  June  1983  (  36,000  observations). 
This  data  set  includes  41  observing  sessions  involving  both  N. A. 
and  European  sites,  with  40  of  these  observing  sessions  involving 
Onsala,  Sweden. 

This  paper  is  the  first  of  a  series  which  will  discuss  the 
quality  of  the  data,  and  the  precision  and  accuracy  of  the  geodetic 
parameters  which  can  be  estimated  from  these  data.  Here  we  address 
the  question  of  the  precision  and  accuracy  of  the  estimates  of  tha 
baseline  lengths  between  the  radio  telescopes  participating  in 
these  observing  sessions. 

The  "true"  accuracy  of  the  baseline-length  estimates  cannot 

be  determined  reliably  without  a  more  accurate  standard  to  which 

the  VLBI  results  may  be  compared.  No  such  standard  exists  which 

will  allow  accuracy  to  be  checked  at  the  necessary  level  of  one 
g 

part  in  10  over  intercontinental  distances  (corresponding  to  =  0.5 
cm  for  a  baseline  between  radio  telescopes  located  in  Massachusetts 


and  Sweden).  Without  such  a  standard,  we  can  still  address  the 
accuracy  question  by  investigating  the  variations  in  the  baseline-length 
estimates  with  changes  in  the  data  set  or  with  changes  in 
parameterization  of  the  models  used  to  analyze  the  data.  These 
techniques,  will,  however,  only  place  a  lower  bound  on  the  estimates 
of  the  accuracy  of  the  results. 

The  main  purpose  of  the  Crustal  Dynamics  geodetic  VLBI 
experiments  involving  sites  in  North  America  and  Europe  is  measure 
the  contemporary  rate  of  motion  between  these  continents,  or  to 
place  a  useful  upper  bound  on  this  rate.  Measurement  precision  is 
more  important  for  this  purpose  than  accuracy.  But  an  understanding 
of  accuracy  is  required  to  identify  sources  of  error  that  may  changl 
over  the  time  interval  used  to  estimate  the  rate,  in  our  case 
several  years. 

The  question  of  the  accuracy  of  VLBI  baseline-length  estimates 
involves  many  facets  of  the  technique.  We  limit  discussion  here 
to  the  major  error  source:  the  effects  of  the  neutral  atmosphere, 
or  more  precisely,  the  methods  we  have  adopted  to  study  these 
effects.  We  first  discuss  the  mathematical  and  statistical  models 
which  were  used  to  analyze  the  observations,  and  then  the  effects 
on  the  baseline-length  estimates  of  (i)  limiting  the  minimum 
elevation  angle  of  observations  used  in  the  analysis;  (ii)  using 
surface  weather  data  to  calibrate  the  neutral  atmosphere  delay; 
and  (iii)  estimating  parameters  that  relate  elevation  angle  to  air 
mass  (see  below) .  We  also  compare  our  results  for  the  Haystack — 
to-Onsala  baseline  length  with  values  previously  published.  * 


The  data  analyzed  consisted  of  dual-frequency-band  interferometric 
group  delays  (Shapiro,  1976;  Rogers  et  al..  1983),  obtained  in  123 
observing  sessions  between  November  1979  and  June  1983.  (Each  of 
these  sessions  was  of  approximately  24  hr  duration.)  The  frequency 
bands  used  were  centered  at  8.3  GHz  (X-band)  and  2.3  GHz  (S-band)  . 
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Within  each  frequency  band,  data  were  obtained  in  four  to  eight 
channels,  each  of  2  MHz  bandwidth.  The  total  bandwidth  spanned  by 
these  channels  varied  between  300  and  360  MHz  at  X-band,  and  50 
and  85  MHz  at  S-band,  thus  allowing  accurate  estimates  of  the  group 
delays  (Whitney  et  al . .  1976) . 

These  observations  were  analyzed  within  the  framework  described 
in  Herring  et  al .  (1981).  However,  the  following  models  were  used 
in  place  of  their  counterparts  described  there:  (i)  The  1AU-1980 
nutation  series  (Wahr,  1981);  (ii)  the  Greenwich  sidereal  time 
formula  of  Aoki  et  al.  (1982);  and  (iii)  the  atomic  time-coordinate 
time  relationship  that  includes  relativistic  corrections  (Shapiro, 
et  al..  1983) .  This  new  relativistic  correction  accounts  for  th£ 
change  in  the  light  propagation  time  in  the  potential  fields  of 
the  Earth  and  the  Sun.  In  addition,  we  added  corrections  for  the 
ionospheric  propagation  delay,  based  on  the  dual-frequency-band 
observations,  and  for  the  neutral  atmospheric  propagation  delay, 
based  on  surface  weather  data  (Marini,  1975). 

We  assumed  that  the  observation  noise  was  an  independent,  zero 

mean,  Gaussian  random  process.  The  variance  of  the  noise  for  each 

observation  was  computed  as  the  sum  of  two  variances.  The  first 

variance  was  computed  from  the  signal-to-noise  ratio  (SNR)  of  the 

observation  and  represents  the  uncertainty  in  the  observation  due 

predominantly  to  instrumental  noise.  The  second  variance  was 

2 

calculated  from  the  condition  that  the  x  /degree  of  freedom  of  the 
postfit  residuals  for  each  baseline  be  unity  (see  below)  .  This 
variance  is  used  to  account  for  the  pseudo-noise  contribution  fron* 
the  inadequate  models  used  to  represent  the  data. 


The  observations  were  analyzed  using  a  sequential,  weighted- 
least-squares  technique  (see,  for  examples,  Kaula,  1966;  Morrison, 
1969) ,  with  the  weights  being  taken  as  the  inverses  of  the  variances 
of  the  noise  contributions  to  the  observations.  Unless  otherwise 


stated/  the  following  classes  of  parameters  were  estimated  in  the 
solutions  to  be  discussed  below:  (i)  Coefficients  of  one  to  three 
polynomials  in  timef  each  of  first  or  second  order,  and  in  some 
cases,  coefficients  of  diurnal  sine  and  cosine  functions.  These 
polynomial  and  trigonometric  functions  were  used  to  represent  the 
relative  behavior  of  the  hydrogen  maser  "clocks"  at  the  different 
sites;  (ii)  a  daily  bias  correction  to  the  neutral  atmosphere 
propagation  delay  calculated  from  surface  weather  data;  (iii)  all 
crust-fixed  site  coordinates  (assumed  time  independent)  ,  except 
for  those  of  one  site  which  were  used  to  define  the  origin  of  the 
coordinate  system  (see  Herring  et  al..  1981);  (iv)  All  radio  source 
positions,  save  for  one  parameter  —  the  mean  correction  to  the 
right  ascensions  of  seven  radio  sources  which  served  as  the  right 
ascension  origin.  (The  seven  radio  sources  chosen  were  0851+202, 
0923+392,  1226+023,  1641+399,  2134+004,  2200+420,  and  2251+158); 
(v)  daily  corrections  to  the  pole  position  and  UT1  given  by  BIH 
Circular  D;  (vi)  daily  corrections  to  the  nutation  angles  given  by 
the  IAU-1980  nutation  series  (to  be  discussed  separately  in  Herring 
et  al . .  1983)  . 

In  addition  to  these  parameters,  we  sometimes  estimated  a 
correction  to  the  relationship  between  the  elevation  angle  of  the 
measurement  and  the  air  mass  through  which  the  signal  passed  (this 
relationship  will  be  called  the  "mapping  function") .  We  parameterized 
the  mapping  function  given  by  Chao  (1972) ,  by  introducing  a  parameter 
corresponding  to  a  fractional  correction  to  one  constant  in  his 
empirical  formula:  * 

te  =  t2/[sin(e)  +  (1+a) r^/ (tan (e) +r2) J  ,  (1) 

where  tfi  is  the  atmospheric  delay  for  an  observation  at  an  elevation 
angle,  &,  when  the  zenith  delay  is  tz,  and  ^=0.00143  and  1-2=0.0445 
are  two  constants  estimated  (by  Chao)  by  relating  his  empirical 
model  to  a  ray-trace  model,  and  a  is  the  parameter  we  introduced. 

VLBI  data  analysis  also  involves  a  number  of  subjective  aspects: 


(i)  the  parameterization  of  the  relative  behaviors  of  the  hydrogen 
masers  at  the  radio  telescopes,  (ii)  the  parameterization  of  the 
atmospheric  delay  behavior  at  each  site,  (iii)  data  editing,  and 
(iv)  data  reweighting  (see  discussion  below).  We  have  already 
discussed  the  the  models  which  are  used  to  parameterize  the  relative 
clock  behaviors  and  the  atmospheric  delays.  The  polynomial  order 
and  the  epochs  of  the  boundaries  between  estimates  of  the  polynomial 
coefficients  are  decided  on  by  the  analysts  after  a  visual  inspection 
of  the  postfit  residuals  from  a  solution  which  utilizes  relatively 
simple  parameter izations  of  the  clock  behavior.  Trends  in  the 
residuals  which  are  radio-source  independent  are  usually  associated 
with  clock  variations.  Similarly,  trends  which  are  elevation-angle 
dependent  are  usually  associated  with  atmospheric  variations.  In 
our  analysis  a  single  atmospheric  parameter  was  estimated  each  day 
at  each  site  and  served  to  remove  any  constant  biases  in  the  surface 
weather  model  calibration  of  the  atmospheric  delays.  Data  editing, 
i.e.  the  elimination  of  observations  which  appear  to  be  defective, 
is  also  subjective  in  nature.  However,  an  examination  of  the 
observations  which  in  the  past  have  been  defective,  the  reason  for 
the  defect  can  often  be  established,  thus  allowing  future  observations 
to  be  checked  for  this  condition.  Consequently,  as  the  the  use 
the  Mark  III  has  increased,  a  series  of  quality  checks  has  been 
developed  which  allow  defective  observations  to  be  detected  before 
the  analyst  examines  the  data.  These  checks  have  eliminated  many 
of  the  subjective  decisions  associated  with  data  editing.  We 
discuss  shortly  the  techniques  which  we  used  to  ensure  that  tf 
uniform  editing  criterion  was  used  throughout  our  analysis.  Data 
reweighting  is  the  final  subjective  aspect  of  the  data  analysis  we 
need  to  discuss.  The  scatter  of  the  postfit  residuals  from  a  Mark 
III  least  squares  solution  is  nearly  always  greater  than  the  standard 
deviations  of  the  observations  would  indicate.  This  greater  scatter 
occurs  because  the  observation  standard  deviations,  which  are 
calculated  from  the  signal-to-noise  ratio  (SNR)  of  the  observations, 
do  not  take  into  account  the  errors  in  the  mathematical  models  used 


to  analyze  the  data  and  in  the  random  variations  of  the  frequency 
standards.  The  contribution  of  these  additional  uncertainties  can 
be  accounted  for  by  adding  a  constant  variance  to  the  SNR-calculated 
variances  and  by  using  this  new  variance  to  calculate  the  weights 
used  in  the  weighted  least-squares  solution.  This  operation  is 
call  "reweighting.”  The  baseline-dependent  variance  is  computed 
from  postfit  residuals  and,  hence,  depends  on  the  solution  used  to 
generate  these  postfit  residuals.  We  discuss  below  the  techniques 
we  used  to  determine  these  baseline-dependent  variances. 

Our  analysis  was  carried  out  in  several  stages  with  the  aim 
of  generating  a  data  set  which  was  both  uniform  in  its  editing  and 
reweighting  criteria.  We  firstly  carried  out  a  solution  using  alf 
of  the  observing  sessions.  We  then  reanalyzed  each  session  separately 
with  the  radio-telescope  and  radio-source  positions  constrained  to 
the  values  obtained  from  the  analysis  of  all  of  the  data.  We 
rejected  all  observations  which  had  postfit  residuals  greater  than 
three  (reweighted)  standard  deviations.  Using  this  edited  data 
set,  we  calculated  baseline-dependent  variances  which,  when  added 
to  the  SNR  variances  of  the  observations,  would  make  the 
degree  of  freedom  of  the  postfit  residuals  for  each  interferometry 
approximately  unity.  Since  both  the  editing  and  the  reweighting 
depend  on  the  the  station  and  source  positions  used  in  the  analysis 
of  each  session,  we  repeated  the  above  procedure  three  times  to 
achieve  convergence.  The  baseline-length  estimates  changed  by  up 
to  3.6  standard  deviations  (third  iteration,  formal)  during  these 
iterations.  (The  largest  change  was  3.6  cm  for  the  Owens  Valle/ 
to  Onsala  baseline.)  The  average  change  of  the  baseline-length 
estimates  was  1.5  standard  deviations  (corresponding  to  approximately 
1-2  cm;  see  Table  1) . 

The  final  data  set  used  to  study  the  accuracy  of  the  baseline 
length  estimates  consisted  of  22,892  group-delay  observations  of 
which  15,928  were  used  in  the  sequential  weighted  least  squares 
solution.  The  data  which  were  not  used  either  failed  quality  checks 
during  correlation  (5,987  observations),  or  were  rejected  by  the 
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editing  criterion  discussed  above  (977  observations)  .  The  large 
number  of  observations  rejected  by  the  "three-sigma*  criterion  is 
probably  due  to  instrumental  errors  which  were  not  detected  during 
correlation  (especially  in  the  early  observing  sessions  before  the 
development  of  an  extensive  set  of  correlator  checks) ,  and  to  large 
postfit  residuals  caused  by  inadequate  models.  In  addition  to  this 
data  set,  we  also  analyzed  a  further  8,688  observations  obtained 
in  observing  sessions  carried  out  between  January  1982  and  June 
1983,  which  were  not  available  when  the  accuracy  studies  were 
carried  out.  The  combined  data  set  was  used  to  study  the  precision 
of  the  baseline  length  estimates. 

&£S  lilts 
The  data  set 

The  data  set  of  nearly  16,000  group  delay  observations, 
described  in  the  previous  section,  was  used  to  study  the  effects 
of  the  neutral  atmosphere  on  the  estimates  of  baseline  length. 
The  specific  tests  we  performed  were:  (i)  limiting  the  data  set 
to  observations  whose  elevation  angles  were  above  a  minimum  value 
called  the  "elevation-angle  cutoff";  (ii)  comparing  solutions  with 
and  without  a  surface  weather  model  prediction  of  the  neutral 
atmosphere  delay;  and  (iii)  comparing  the  solutions  with  and  without 
estimation  of  a  correction  to  the  atmospheric  mapping  function. 

The  result  of  limiting  the  data  set  to  observations  whose 
elevation  angles  were  greater  than  the  elevation  cutoff  are  summarized 
in  Figure  1.  (A  zero  degree  elevation-angle  cutoff  simply  means 
that  all  available  data  were  used  in  the  solution.  The  actual 
minimum  elevation  angle  of  any  observation  in  the  data  set  was 
3?5.)  The  error  bars  shown  in  Figure  1  are  the  standard  deviations 
of  the  differences  in  the  baseline-length  estimates  (since  the 
solution  with  the  lower  elevation  cutoffs  contain  data  which  were 
not  used  in  those  with  higher  elevation-angle  cutoffs,  there  should 
be  differences  between  the  solutions  consistent  with  these  standard 
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deviations) .  (It  is  easily  shown  that  the  variance  of  the  difference 
is  the  difference  of  the  variances  of  the  two  solutions,  in  the 
sense  of  the  larger  variance  (solution  with  the  higher  elevation- angle 
cutoff)  minus  the  smaller  variance  (the  solution  with  the  lower 
elevation-angle  cutoff).) 

Figure  1  shows  that  the  changes  in  the  baseline-length  estimates 
when  the  elevation-angle  cutoff  is  changed,  are  not  in  accord  with 
the  statistics  of  the  difference.  Systematic  differences  of  up  to 
1  cm  exist  between  the  0°  and  10°  cutoff  solutions.  The  differences 
between  the  solutions  with  the  10°  cutoff  and  those  with  the  other 
cutoffs  do  not  show  such  strong  systematic  trends,  but  still  show 
scatter  larger  than  the  appropriate  standard  deviations  woul<§ 
indicate.  In  particular  the  differences  between  the  solution  with 
10°  and  20°  cutoffs  show  large  differences  (between  3  and  5  cm) 
for  baselines  longer  than  8,000  km.  These  large  differences,  and 
their  associated  large  uncertainties,  could  be  due  to  geometric 
degeneracy.  (Many  of  the  low-elevation-angle  observations  in  this 
data  set  were  included  to  strengthen  the  geometry  of  these  very 
long  baselines.) 

To  further  investigate  the  origin  of  the  systematic  differences 

between  the  solutions  using  different  elevation-angle  cutoffs,  we 

compiled  the  statistics  of  the  postfit  residuals  from  these  solutions. 

We  accumulated  the  statistics  in  a  matrix  whose  row  and  column 

indices  were  defined  by  the  air  masses  (calculated  from  the  elevation 

angles)  at  the  two  sites  for  each  baseline.  The  air  mass  bins  were 

0.5  air  masses  wide  and  ranged  between  1.0-1. 5  and  14.0-14.5  ait* 

masses  (the  upper  limit  corresponds  to  an  elevation  angle  of  3?1) . 

We  present  in  Figure  2,  for  the  0°  cutoff  and  the  10°  cutoff 

solutions,  the  statistics  of  the  column  of  this  matrix  for  which 

the  air  mass  bin  was  1.0-1. 5  at  one  end  of  the  baseline.  We  show 

the  weighted  mean  and  the  normalized- root-mean  square  (NRMS)  scatter 

of  the  residuals  in  each  bin.  (The  NRMS  scatter  is  the  square  root 
2 

of  the  x  /degree  of  freedom  for  the  residuals  in  the  bin.  We  plot 
this  quantity,  rather  than  the  x  /degree  of  freedom,  to  decrease 
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the  dynamic  range  of  the  plots.)  The  error  bars  shown  for  the 
weighted  means  are  one  standard  deviation  estimates,  based  on  the 
reweighted  standard  deviations  of  the  observations.  The  95% 
confidence  intervals  shown  for  the  NRMS  scatter  are  also  based  on 
the  reweighted  standard  deviations. 

It  is  clear  that  the  NRMS  scatters  show  a  dependence  on  the 
air  mass  in  both  the  10°  and  0°  cutoff  solutions.  This  elevation 
angle  dependence  is  not  accounted  for  in  the  observation  standard 
deviations,  i.e.,  the  low  elevation  angle  (high  air  mass)  observations 
are  overweighted  weighted  in  these  analyses.  The  effects  of  this 
higher  weighting  can  be  seen  in  the  weighted  mean  results.  The 
positive  bias  of  the  weighted  means  in  air  mass  bins  below  10* 
elevation  angle  (as  seen  in  the  10°  elevation  angle  cutoff  solution) 
are  considerably  reduced  in  the  0°  cutoff  solution.  This  reduction 
in  the  weighted  means  implies  that  thi6  systematic  bias  has  been 
absorbed  into  the  parameter  estimates. 

To  test  the  hypothesis  that  the  relationship  between  air  mass 
and  elevation  angle  could  be  systematically  biased,  we  carried  out 
a  solution  using  a  0°  elevation  angle  cutoff,  but  with  a  site-  and 
observing-session  dependent  correction  to  the  mapping  function 
estimated.  The  mapping  function  correction  was  estimated  at  a  site 
only  when  there  were  observations  with  elevations  angles  less  than 
10°.  We  show  the  difference  between  this  solution  and  the  solution 
with  the  10°  cutoff  (without  a  correction  to  the  mapping  function 
estimated)  in  Figure  3.  The  error  bars  shown  on  the  plot  are  the 
same  as  those  in  Figure  l.a.  Figure  3  does  not  show  the  systematic 
trends  of  Figure  l.a  indicating  that  an  error  in  the  mapping  function 
could  be  partially  responsible  for  the  differences  between  the 
solutions  that  use  different  elevation  angle  cutoffs.  While 
estimating  corrections  to  the  mapping  function  6eems  to  remove  the 
systematic  difference  between  the  solutions  with  different  elevation 
angle  cutoffs,  the  estimated  values  of  a  (see  Equation  1)  have 
large  scatter  with  weighted  mean  values  of  only  a  few  percent. 
(The  correction  to  r^  at  Onsala  was  4%  with  a  RMS  scatter  of  11%. ) 


However,  the  magnitude  of  the  weighted  mean  values  are  consistent 
with  the  elevation  angle  dependence  of  the  postfit  residual  mean 
values  shown  in  Figure  2.  These  studies  of  the  atmospheric  mapping 
function  are  very  preliminary  and  certainly  need  to  be  continued. 
We  could,  for  example,  assume  that  the  correction  is  site  and 
observing  session  independent  and  estimate  a  single  value  for  the 
correction  from  all  of  the  data.  (Our  software  would  need  some 
changes  to  allow  this  estimation.)  An  alternative  approach  would 
be  to  carry  out  specialized  observing  sessions  in  which  a  combination 
of  very  low  elevation  and  high  elevation  angle  observations  were 
made  in  rapid  succession.  These  observing  sessions  at  a  number  of 
sites  and  in  different  seasons  could  allow  accurate  estimation  of 
corrections  to  the  mapping  function. 

We  further  investigated  the  effects  of  our  atmospheric  delay 
models  by  repeating  the  solution  with  the  10°  elevation  angle  cutoff 
without  the  surface  weather  model  prediction  of  the  atmospheric 
delay.  (In  both  solutions  we  estimated  zenith  delay  corrections 
once  daily  at  each  site.)  The  differences  between  the  baseline-length 
estimates  from  these  two  solutions  are  shown  in  Figure  4.  Again 
we  see  a  bias  between  the  baseline-length  estimates  which  increases 
with  baseline  length.  (In  this  case  the  error  bars  shown  are  the 
standard  deviations  of  the  baseline-length  estimates.)  These  results 
indicate  that  care  should  be  taken  in  combining  data  sets  with  and 
without  surface  weather  model  calibration  of  the  atmospheric  delay. 

In  Table  1  we  give  the  baseline-length  estimates  obtained  from 
the  solution  with  the  10°  cutoff  with  surface  weather  model* 
calibration  of  the  atmospheric  delay  We  also  give  the  WRMS  and  NRMS 
scatters  of  the  baseline-length  estimates,  about  these  values, 
obtained  from  a  solution  in  which  the  source  positions  were 
constrained  to  the  values  obtained  from  the  ensemble  of  data  and 
the  station  positions  were  estimated  separately  for  each  observing 
session.  We  also  tabulate  the  WRMS  and  NRMS  scatters  of  baseline-length 
estimates  from  same  type  of  analysis  with  the  0°  cutoff.  These 
scatters  highlight  why  we  believe  low  elevation  angle  observations 
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should  be  studied.  In  nearly  all  cases  the  WRMS  scatter  of  the 
baseline-length  estimates  from  the  solution  with  the  0°  cutoff  are 
less  than  those  from  the  solution  with  the  10°  cutoff,  indicating 
that  the  solution  with  the  0°  cutoff  is  more  precise  than  that  with 
the  10°  cutoff.  In  both  cases  the  normalized  RMS  scatters  (which 
should  be  unity  if  the  scatter  of  the  results  were  in  accord  with 
their  statistics),  generally  range  between  1.5  and  2.0.  These 
values  indicate  that  the  uncertainties  of  the  baselines-length 
estimates  are  "optimistic”  by  a  factor  of  between  1.5  and  2.0. 
The  final  entry  in  Table  1  is  the  contribution  of  the  new  relativistic 
correction  to  the  baseline  length.  (We  include  these  values  so 
that  our  results  may  be  compared  with  previously  published  values 
of  these  baseline  lengths.)  This  contribution  has  an  average  value 

Q 

of  -14  parts  in  10  (Shapiro  et  al..  1984). 

We  show  the  estimates  of  the  Westford-to-Onsala  baseline  length 
for  each  observing  session  in  Figure  6.  The  slope  of  a  linear 
least  squares  regression  through  these  results  (from  3.0  years  of 
data)  is  1.6  ±  0.5  cm/yr  (the  uncertainty  is  the  statistical  estimate 
of  standard  deviation,  scaled  by  the  NRMS  scatter  of  the  baseline-length 
estimates)  . 

Estimates  of  some  of  these  baseline  lengths  have  been  published 
in  the  past;  we  now  compare  our  estimates  with  those  previously 
published  values.  In  1981,  the  first  estimates  of  the  Haystack-to-Onsala 
baseline  were  published  (Herring  et  al. .  1981).  At  the  time  we 
felt  that  this  estimate  of  the  baseline  length  was  uncertain  at 
only  the  subdecimeter  level.  However,  that  value  (after  compensatioif 
for  the  relativistic  correction;  see  Table  1)  is  21  cm  longer  than 
the  value  given  here;  more  than  twofold  worse  than  we  had  expected. 
We  now  believe  that  the  estimate  obtained  from  the  single- frequency-band 
Mark  I  data  was  positively  biased  by  an  amount  greater  than  expected 
because  of  the  neglect  of  the  ionospheric  delay. 

More  recently  published  values  (Robertson  and  Carter,  1982) 
for  the  Haystack-to-Onsala  baseline  length,  estimated  from  a  subset 
of  the  data  used  here,  have  also  been  positively  biased  by  10  cm. 


In  these  solutions  some  source  positions  were  held  fixed  at  the 
values  obtained  from  the  analysis  of  Mark  I  data  (Robertson  and 
Carter,  1982).  The  value  given  by  Rogers  et  al . .  1983,  agree  within 
2  cm  with  the  value  given  here.  As  we  have  seen,  differences  of 
2  cm  can  be  accounted  for  because  of  slightly  different  solution 
techniques.  (These  results  were  also  obtained  from  a  subset  of 
the  data  analyzed  in  this  paper.) 


We  have  presented  results  which  show  intercontinental  distanc# 
measurement  using  the  Mark  III  VLBI  system  have  a  repeatability  of 
“2.6  cm  for  40  observing  sessions.  We  have  also  attempted  to  study 
the  accuracy  of  these  results  by  carrying  out  a  number  of  repeatability 
tests.  These  tests  have  shown  that  the  estimates  of  the  baseline 
lengths  can  change  by  several  centimeters  depending  on  the  data 
set  and  the  atmospheric  calibrations  used  in  the  analysis.  (For 
the  Westford-to-Onsala  baseline,  the  changes  in  the  baseline  lengths 
of  up  to  2  cm  occurred.) 

The  estimated  rate  of  change  of  the  Westford  to  Onsala  baseline 
is  1.6  ±  0.5  cm/yr.  The  standard  deviation  given  is  the  statistical 
estimate  of  the  rate  from  a  weighted  least  squares  regression 
through  the  baseline  length  estimates,  scaled  by  the  NRMS  scatter 
of  the  these  values  about  their  mean.  Although  the  rate  of  change 
of  the  baseline  length  seems  to  be  significant,  we  do  not  yet  clainw 
to  have  detected  motion  between  these  two  sites  because  the  total 
change  in  the  baseline  length  over  the  three  year  period  is  only 
4.8  cm.  There  is  a  possibility  that  this  apparent  change  in  the 
length  could  be  due  to  changes  in  the  systematic  errors  which  affect 
our  results.  However,  these  errors  are  bounded  at  about  the  5  cm 
level  and  a  continued  lengthening  of  the  baseline  will  be  difficult 
to  dismiss  as  due  to  systematic  error. 

Our  studies  of  the  dependencies  of  the  baseline  length  estimates 
on  elevation  angle  cutoff  and  on  the  statistics  of  the  postfit 


-239- 


residuals  have  indicated  that  the  relationship  between  elevation 
angle  and  air  mass  may  be  systematically  biased.  This  subject 
needs  further  study  because  our  results  also  show  that  the 
repeatability  of  the  baseline  lengths  is  better  when  low  elevation 
angle  observations  are  included  in  the  analysis.  However,  this 
improvement  in  the  repeatability  does  not  necessarily  imply  an 
improvement  in  accuracy. 

A  problem  which  we  need  to  solve  is  the  assessment  of  "realistic" 
standard  deviations  of  the  baseline-length  estimates.  In  nearly 
all  cases  the  WRMS  scatter  of  the  baseline  length  estimates  about 
theii  values  obtained  from  the  ensemble  of  data  are  1.5  to  2.* 
times  the  amount  expected  from  the  standard  deviations.  There  are 
two  reasons  for  this  underestimation  of  the  standard  deviations. 
Firstly,  unmodeled  or  inadequately  modeled  effects  such  as  water 
vapor  delay,  frequency  standard  variations,  radio  source  structure, 
and  ocean  loading  will  contribute  to  the  variability  of  the  baseline 
length  estimates.  These  variations  are  not  totally  accounted  for 
in  the  least  squares  estimates  of  the  uncertainties  (to  some  extent 
they  are  accounted  for  through  the  reweighting  of  the  observation 
uncertainties) .  In  a  future  paper  we  will  discuss  methods  which 
can  be  used  to  assess  these  contributions  to  the  uncertainties  to 
the  baseline  length  estimates.  Secondly,  there  is  a  correlation 
between  the  pseudo-noise  contributions  (due  to  the  unmodeled  site 
dependent  errors)  to  the  observations  involving  a  given  site  at 
each  epoch.  (There  are  also  possibly  temporal  correlations  between 
the  model  errors.)  We  currently  do  not  account  for  these  correlations 
in  the  least  squ-res  solutions  which  will  cause  the  parameter 
estimate  uncertainties  to  be  overly  optimistic.  We  are  currently 
developing  software  which  will  calculate  the  expected  correlations 
between  observations  based  on  the  variances  added  to  the  SNR 
variances. 

The  VLBI  data  used  in  the  analysis  for  this  paper,  and  additional 
data  obtained  in  more  recent  observing  sessions,  can  be  used  for 
many  purposes.  In  future  papers  we  will  discuss  the  performance 


of  the  Mark  III  VLBI  system,  the  positions  of  the  pole  with  respect 
to  the  crust  of  the  earth  and  in  inertial  space  (or  strictly  the 
quasar  reference  frame)  which  were  estimated  in  these  solutions, 
techniques  to  eliminate  the  polynomial  modeling  of  the  relative 
behavior  of  the  hydrogen  masers  (by  use  of  Kalman  filters)  ,  and 
the  precision  and  accuracy  of  the  calibration  of  water-vapor  delay 
with  water  vapor  radiometry. 
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Table  1  Suirmary  of  baseline- length  estimates 


Baseline 

#  Of 

exps. 

length+  sigma+ 

(m)  (m) 

WRMS+ 

(m) 

NFMS+ 

WRMS 

(m) 

*  NFMS* 

Relativity 

correction 

HA-CH 

7 

5,072,314.364  ±  0.006 

0.029 

2.16 

0.028 

2.28 

-0.069 

HA-HR 

30 

3,135,640.956  ±  0.004 

0.017 

1.25 

0.017 

1.28 

-0.045 

HA-CW 

22 

3,928,881.556  ±  0.003 

0.015 

1.50 

0.015 

1.54 

-0.054 

HA-ON 

25 

5,599,714.373  ±  0.005 

0.026 

1.60 

0.023 

1.82 

-0.077 

HA-WE 

6 

1239.395  ±  0.002 

0.004 

1.14 

0.004 

1.14 

-0.000 

HA-EF 

6 

5,591,903.441  ±  0.006 

0.024 

1.53 

0.026 

1.74 

-0.077 

HA-NR 

3 

845,129.839  ±  0.002 

0.010 

2.31 

0.009 

2.31 

£ 

-0.012 

CH-HR 

7 

7,663,737.269  ±  0.010 

0.060 

1.88 

0.063 

1.98 

-0.105 

CH-OV 

7 

7,846,991.133  ±  0.009 

0.047 

1.81 

0.050 

1.92 

-0.106 

CH-ON 

7 

1,109,864.304  ±  0.003 

0.004 

0.46 

0.008 

1.00 

-0.016 

CH-WE 

0 

5,073,265.270  ±  0.007 

-0.069 

CH-EF 

0 

589,796.542  ±  0.004 

-0.008 

CH-NR 

0 

5,834,122.784  +  0.008 

-0.078 

HR-CV 

26 

1,508,195.356  ±  0.002 

0.012 

1.41 

0.012 

1.41 

-0.023 

HR-ON 

37 

7,940,732.104  ±  0.009 

0.063 

1.49 

0.054 

1.38 

-0.111 

HR-WE 

120 

3,134,927.971  +  0.003 

0.021 

1.47 

0.021 

1.51 

-0.045 

HR-EF 

5 

8,084,184.751  ±  0.011 

0.064 

1.54 

0.043 

1.05 

-0.113 

HR-NR 

4 

2,354,633.977  ±  0.003 

0.010 

1.52 

0.009 

1.25 

-0.033 

OV-ON 

26 

7,914,130.853  ±  0.008 

0.039 

1.41 

0.035 

1.44 

-0.110 

CV-WE 

28 

3,928,579.286  ±  0.004 

0.017 

1.63 

0.015 

1.51 

-0.055 

OV-EF 

6 

8,203,742.365  ±  0.010 

0.049 

1.99 

0.048 

2.00 

-0.114  * 

OV-NR 

4 

3,324,244.142  ±  0.004 

0.009 

1.32 

0.009 

1.35 

-0.046 

GN-WE 

40 

5,600,741.326  ±  0.006 

0.026 

1.43 

0.023 

1.58 

-0.078 

ON-EF 

5 

832,210.49 9  ±  0.003 

0.008 

1.28 

0.007 

1.18 

-0.013 

ON-NR 

2 

6,319,317.417  ±  0.008 

0.027 

1.19 

0.018 

0.88 

-0.087 

WE-EF 

0 

5,592,850.979  ±  0.007 

-0.077 

WE-NR 

4 

844,148.075  ±  0.002 

0.003 

0.70 

0.003 

0.71 

-0.012 

EF-NR 

1 

6,334,648.392  ±  0.009 

0.067 

2.74 

0.069 

2.82 

-0.087 
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Table  1  Continued. 

+  10°  elevation  angle  cutoff  solution 
*0°  elevation  angle  cutoff  solution 

The  codes  for  the  stations  are:  HA  Haystack,  Massachusetts;  WE 
Westford,  Massachusetts;  ON  Onsala,  Sweden;  CH  Chilbolton,  England; 
EP  Effelsberg,  Federal  Republic  of  Germany;  NR  National  Radio 
Astronomy  Observatory,  Virginia;  HR  Harvard  Radio  Astronomy  Telescope, 
Texas;  OV  Owens  Valley  Radio  Observatory,  California.  The 
repeatability  of  the  estimates  of  baselines  involving  Westford  have 
been  calculated  including  observing  sessions  which  used  the  Haystack 
observatory,  by  inferring  the  lengths  to  Westford  from  the  lengths 
to  Haystack.  The  weighted  RMS  scatter  of  the  postfit  residuals 
from  the  10°  and  the  0°  cutoff  solutions  were  0.0  90  nsec  (for  15,615 
observations)  and  0.091  nsec  (for  15928  observations) ,  respectively. 
The  speed  of  light  used  to  convert  measured  delays  to  distances 
was  2.99792458xl08  m/sec. 


Figure  1.  Differences  in  the  baseline  length  estimates  between 
solutions  with  different  elevation  angle  cutoffs.  The  error  bars 
represent  the  standard  deviation  of  the  difference  between  the 
estimates  due  to  the  additional  data  included  in  the  lower  elevation 
cutoff  solutions,  a)  0°  elevation  angle  cutoff  solution  minus  the 
10°  elevation  angle  cutoff  solution;  b)  10°  elevation  angle  cutoff 
solution  minus  the  15°  elevation  angle  cutoff  solution;  c)  10° 
elevation  angle  cutoff  solution  minus  the  20°  elevation  angle  cutoff 
solution  t 

Figure  2.  The  accumulated  statistics  of  the  postfit  residuals  in 
air  mass  bins  between  1  air  mass  and  10  air  masses  for  the  0° 
cutoff  and  the  10°  cutoff  solutions.  The  normalized  RMS  (NRMS) 
scatter  is  the  square  root  of  the  chi-squared  per  degree  of  freedom 
for  the  postfit  residuals.  The  95%  confidence  intervals  for  each 
air  mass  bin  give  the  region  in  which  the  NRMS  scatter  should  lie 
(with  95%  probability)  if  the  scatter  of  the  postfit  residuals  were 
in  accord  with  their  statistics. 

Figure  3.  Differences  of  the  baseline-length  estimates  between 
the  10°  elevation  angle  cutoff  solution  and  the  0°  elevation  angle 
cutoff  solution  with  a  correction  to  the  atmospheric  mapping  function 
estimated.  % 

Figure  4.  Differences  of  the  baseline-length  estimates  between 
the  10°  elevation  angle  cutoff  solutions  with  and  without  a  surface 
weather  model  calibration  of  the  neutral  atmospheric  delay.  In 
each  solution  we  estimated  corrections  to  the  zenith  delay  at  each 
site. 

Figure  5.  Estimates  of  the  Westf ord-to-Onsala  baseline  length  from 
the  10°  elevation  angle  cutoff  solution.  The  station  positions 


re  estimated  separately  for  each  observing  session  with  the  source 
sitions  constrained  to  the  values  obtained  from  the  global  solution 
sed  on  all  of  the  data.  The  uncertainty  of  the  slope  has  been 
aled  by  the  NRMS  scatter  of  the  individual  length  estimates. 


elevation  cut-off  solution 
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Abstract:  VLBI  observations  made  in  September  and  October  1980 
with  up  to  five  telescopes  distributed  over  northern  Europe 
and  the  United  States  yielded  values  for  solid  earth  tide 
parameters ; 

h  »  0.62  *  0.01  (radial  Love  number) 
i  »  0.11  t  0.03  (horizontal  Love  number) 

♦  •  1?  si?  (tidal  lag  angle) 

Keywords;  Love  numbers,  tidal  lag,  radio  interferometry 
1.  Introduction 

Very  Long  Baseline  Interferometry  (VLBI)  has  been  used  since  1969 
to  determine  vector  separations  between  radio  telescopes  and  posi¬ 
tions  of  radio  sources.  Initially,  the  precision  of  baseline  length 
determinations  was  about  1  m  (HINTEREGGER  et  al.,  1972).  Since 
then  the  precision  of  baseline  length  determinations  has  improved 
considerably  and  is  now  under  3  cm  for  baselines  from  1000  km  to 
9000  km  (ROBERTSON  et  al.,  1979)  and  subdecimeter  for  intercontin- 
tal  lengths  of  about  6,000  km  (HERRING  et  al.,  1981).  Observations 
made  with  the  new  Mark  Ill  VLBI  system  should  allow  baseline 
lengths  to  be  determined  with  a  precision  of  approximately  1  cm. 

As  the  precision  of  observations  increases,  we  must  ensure  that  the 
models  are  also  improved  for  the  interpretations  of  the  observa¬ 
tions  . 

Solid  earth  tides  have  long  been  incorporated  in  the  model  used 
to  analyze-  VLBI  observations,  and  preliminary  results  have  already 
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been  obtained  for  the  radial  Love  number,  h  (ROBERTSON,  1975). 

Simul caneous  analyses  of  six  (6)  years  of  VLBI  data,  obtained  with 
the  Mark  1  system  (RYAN  et  al.,  1981)  yielded  h  -  0.620±0.005.  The 
standard  deviation  of  the  radial  Love  number  is  based  solely  on  the 
scatter  of  the  postfit  residuals;  the  true  standard  error  is  likely 
to  be  about  a  factor  of  three  larger,  partly  owing  to  the  scatter 
of  results  from  disjoint  data  sets.  In  this  paper  we  present  our 
determinations  of  radial  and  horizontal  Love  numbers  and  the  tidal 
lag  angle,  from  analyses  of  two  weeks  of  VLBI  data  obtained  during 
September  and  October  1980  with  the  Mark  III  system. 

2.  Data  analyses 

The  analyzed  data  consisted  of  interferometric  group  delays 
(SHAPIRO,  1976)  which  were  obtained  during  two  one-week  sessions, 
separated  by  two  weeks,  in  September  and  October,  1980.  Up  to  five 
antennas  were  involved  in  each  session.  Radio  telescopes  in  West- 
ford,  Massachusetts,  Ft.  Davis,  Texas,  Big  Pine,  California,  and 
Onsaia,  Sweden  participated  in  all  sessions.  In  the  first  two 
days  of  the  first  week,  the  radio  telescope  at  Bonn,  FRG,  partici¬ 
pated,  and  the  radio  telescope  at  Chilbolton,  England,  participated 
for  all  of  the  second  weeR.  All  observations  were  made  with  the 
dual -band  Mark  III  system  with  center  radio  frequencies  of  about 
8.3  GHz,  and  2.3  CHz  for  the  two-frequency  bands.  A  multichannel 
bandwidth  synthesis  technique  (WHITNEY  et  al.,  1976)  was  used,  with 
each  recorded  channel  having  a  bandwidth  of  U  MHz  (upper  &  lower  side 
bands);  synthesized  bandwidths  were  300  MHz  at  8.3  GHz;  75  MHz  at  2.3  GHz. 

The  theoretical  models  and  the  coordinate  system  definitions 
used  for  the  analyses  are  summarized  in  HERRING  et  al.  (1981).  The 
dual-band  observations  were  used  to  calibrate  the  ionospheric 
delays  and  surface  weather  data  were  used  to  estimate  the  tropo¬ 
spheric  delays.  Appropriate  parameters  for  each  day  were  included 
in  the  theoretical  model  to  represent  relative  clock  behavior  and 
possible  residual  errors  in  the  tropospheric  path  delay. 

Because  this  paper  deals  with  the  determination  of  the  Love 
numbers  for  the  solid  Earth,  only  the  tidal  models  will  be  described 
in  detail.  For  convenience  in  calculation,  the  tidal  disturbing 
potential  is  modified  and  written  as 
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where  r  is  the  geocentric  position  vector  of  the  disturbing  body, 
rotated  about  the  Earth’s  rotation  axis  by  the  tidal  lag  angle  f , 
a  is  the  geocentric  position  vector  of  the  radio  telescope 
and  GM  is  the  product  of  the  gravitational  constant  and  the  mass  of 
the  disturbing  body. 

Using  Eq.  1,  the  disturbing  potential  and  its  derivatives  with 
respect  to  latitude  and  longitude,  3V/36  and  3V/3X,  respectively, 
are  calculated  for  each  observation  at  each  telescope  fcom  ephemeris 
positions  for  the  sun  and  moon.  These  values  are  then  converted 


to  displacements 

using 

uf  «  h 

(V)g'1 

ue  -  l 

(3V/38  Ig'1 

u*  '  1 

(3V/3X)  (geos  6)"1 

where  h  and  l  are  the  radial  and  horizontal  Love  numbers,  respec¬ 
tively,  g  is  the  normal  acceleration  of  gravity  at  the  site,  and 
ur,  u0 ,  and  u^  are,  respectively,  the  radial,  north,  and  east  dis¬ 
placements  of  the  telescopes. 

The  displacements  are  represented  very  accurately  using  Eq.  2, 
since  the  disturbing  body  positions  are  available  from  other  compu¬ 
tations  in  the  VLBI  data  analyses.  The  disadvantage  in  using  Eq.  2 
is  the  required  use  of  Love  numbers  and  lag  angle  that  are  indepen¬ 
dent  of  the  harmonic  content  of  the  forcing  function.  The  parameters 
h,  1,  and  $  and  their  covariances  are  estimated  simultaneously 
with  the  other  relevant  parameters  form  the  VLBI  data  using  stan¬ 
dard  weighted  least  squares. 

3.  Results 

Figures  1,  2  and  3  present  our  estimates  of  the  tidal  lag  angle 
and  the  radial  and  horizontal  Love  numbers.  Two  features  are 
apparent:  1)  the  scatter  of  each  of  the  results  about  its  weighted 
mean  is  up  to  10  times  the  formal  standard  deviations  of  a  single 
estimate,  and  2)  the  distribution  of  the  individual  results  about 
their  weighted  mean  seems  to  be  systematic  in  each  case.  These 
systematic  trends  might  arise  at  least  in  part  from  neglected 
perturbations  on  the  earth  tides  due  to  ocean-loading  effects. 

The  variation  in  the  formal  standard  deviations  of  the  estimates 
of  the  tidal  parameters  is  approximately  in  accordance  with  that 
of  the  number  of  usable  observations  obtained  each  day.  These 
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numbers  ranged  between  236  for  October  1,  1980  with  four  stations 
participating  to  777  for  October  22,  1980  with  five  stations  par¬ 
ticipating.  The  small  numbers  of  observations  obtained  on  October 
1  and  2  were  due  to  a  telescope  pointing  problem,  subsequently 
fixed  ,  at  the  antenna  in  Westford,  Massachusetts.  Even  accounting 
for  the  variations  in  formal  standard  deviations,  the  scatter  in 
the  results  from  the  first  week  are  greater  than  that  from  the 
second  week.  There  are  several  possible  explanations  for  this 
difference:  1)  During  the  first  two  days  of  the  first  week,  the 
antenna  in  Bonn,  FRG  participated.  If  the  Love  numbers  are  a 
function  of  the  station  locations,  then  the  average  of  the  estimates 
from  these  two  days  could  be  expected  to  differ  from  that  for  the 
remainder  of  the  week;  and  2)  For  two  days  of  the  first  week,  all 
observations  west  of  the  meridian  were  lost  at  the  Westford  antenna. 
This  bias  in  the  distribution  of  the  observations  on  the  sky  could 
increase  the  effects  of  systematic  errors  on  the  results. 


Weighted 
Mean  0.62*0.01 


Figure  1.  Independent  ententes  of  the  redial  Love  number,  h. 

The  error  bars  represent  plus  and  ninua  one  atandard 
deviation,  determined  by  the  addition  of  a  constant  to  the 
variance  for  each  measurement  such  that  >*  per  degree  of 
freedom  was  unity  from  each  weighted  least  squares  solution. 
The  quoted  uncertainty  for  the  weighted  mean  was  calculated  by 
multiplying  its  theoretical  standard  deviation  by  the  ratio  of 
the  scatter  of  the  individual  results  about  their  weighted 
mean  to  the  espected  scatter  due  to  random  error. 


-257- 


-  209  - 

Figure  1  shows  the  weighted  a. can  of  the  results  from  this  analysis: 
h  -  0.62  i  0.01 

l  -  0.11  ±  0.03  (3) 

♦  -  1°  11° 

where  the  quoted  uncertainties  are  based  on  the  scatter  of  the 
individual  results  about  the  weighted  mean. 

The  determination  of  h  is  consistent  with  the  values  obtained 
previously  from  other  VLBI  data  (ROBERTSON,  1975  and  RYAN  et  al., 
1981)  . 

Love  numbers  and  lag  angles  for  each  telescope  location  were 
also  estimated.  The  largest  fractional  difference  between  such  an 
estimate  for  h  and  the  corresponding  global  value  occurred  for 
Big  Pine,  California,  which  has  a  value  of  h“  0.73  l  0.03.  In  view 
of  the  systematic,  time-dependent  behavior  and  the  scatter  of  our 
various  results,  we  do  not  consider  this  difference  to  be  signifi¬ 
cant  . 

The  values  of  the  Love  numbers  obtained  in  this  study  also  agree 
well  with  values  determined  previously  using  other  techniques. 
Theoretical  values  of  the  Love  numbers  typically  range  from  0.60  to 
0.62  for  h,  0.08  to  0.09  for  i  (MELCHIOR,  1978),  with  the  variation 
resulting  from  the  choice  of  earth  model  and  core  rigidity. 
Experimental  determination  of  the  Love  numbers  using  gravimetric, 
tilt  and  extensometr ic  observations  generally  give  combinations  of 
the  Love  numbers  (h,  1  and  k)  which  agree  with  the  theoretical 
estimates  although  the  results  can  be  significantly  perturbed  by 
ocean  tides  and  local  geology.  The  tidal  lag  angle  is  also  obtained 
from  these  techniques.  Estimates  of  its  value  range  between  t5°, 
depending  on  tidal  component,  with  quoted  uncertainties  that  range 
from  0°.l  to  3°  with  the  larger  lag  angles  associated  with  the 
larger  uncertainties  (MELCHIOR,  1978). 

Analyses  of  orbits  of  earth  satellites  also  yield  estimates  of 
the  lag  angle.  Recently,  CAZENAVE  &  DAILLET  (1981)  obtained  lag 
angles  of  0°.32±0°.28  for  the  M2  earth  tide.  They  also  obtained 
1°.  41±0°.28  and  0°.76±0°.29,  depending  on  the  choice  of  Mj  and  0^ 
ocean  tide  models. 
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6.  CONCLUSIONS  AND  DISCUSSION 


The  Mark  III  VLBI  system  is  potentially  capable  of 
measuring  intercontinental  distances  with  one  centimeter 
precision.  In  this  thesis,  we  have  tried  to  determine  the 
proximity  of  the  Mark  III  system  to  this  goal. 

The  analysis  of  the  algorithms  used  to  determine  the 
group  delay,  the  phase-delay  rate,  and  the  visibility  phase, 
from  the  signals  recorded  at  the  radio  telescopes,  did  not 
reveal  any  large  errors  in  these  algorithms.  However,  when  a 
series  of  tests  was  performed  on  the  VLBI  measurements,  we 
found  that  the  apparent  noise  in  these  measurements  did  not 
have  the  statistical  properties  which  were  calculated  from  the 
signal-to-noise-ratio  (SNR)  of  the  observations.  There  were 
clearly  instrumental  deficiencies. 

Analysis  of  the  phase  versus  frequency  response  of  the 
Mark  III  showed  that  the  system  has  large  dispersions.  The 
origin  of  the  dispersion  has  not  yet  been  determined,  but  it 
is  hoped  that  the  properties  of  the  dispersion  given  in  this 
thesis  will  aid  in  isolating  the  origin.  The  main  properties 
of  the  dispersion  are  (i)  it  (the  dispersion)  varies  rapidly 
with  frequency  (variations  of  up  to  40°  have  been  observed 
over  a  4  MHz  change  in  frequency,  see  Figure  3.1.6.);  (ii)  it 
can  be  very  large;  (iii)  it  varies  in  amplitude  smoothly  on 
time  scales  of  months,  and  occasionally  shows  discontinuities, 
see  Figure  3.1.10.  One  possible  cause  for  the  dispersion 


which  seems  to  match  much  of  the  available  data  is  a 


corruption  of  the  phase  calibration  pulses.  In  particular, 
the  possibility  that  extra  pulses  from  the  pulse  generator, 
which  should  be  removed  (or  gated)  from  the  phase  calibration 
signals,  are  being  injected  into  the  receiver  seems  to  require 
further  investigation.  There  are  also  other  possible  causes. 
Among  these,  "ripple"  in  the  video  convertors,  i.e.,  non-lin¬ 
ear  variations  of  phase  with  frequency  across  the  video  con¬ 
verter  bandwidth,  could  explain  the  discontinuity  which  was 
seen  in  the  Westford-Ft.  Davis  interferometer  residual  phases. 

Other  quality  checks  indicated  that  not  all  of  the 
apparent  errors  in  the  group  delays  can  be  explained  by  phase 
calibration  system  errors  or  video  converter  ripple.  The 
group  delay  closure  errors  did  not  match  the  SNR  statistics. 
These  closure  errors  seem  to  be  explained  by  the  leakage  of 
lef t-circularly  polarized  radiation  into  the  output  of  the 
nominally  r ight-circularly  polarized  feed  horns. 

Probably  the  most  powerful  test  we  applied  to  the  Mark 
III  group-delay  measurements  was  the  investigation  of  the 
prediction  of  phase-delay  ambiguities  from  the  group-delay 
measurements.  These  studies  revealed  that  the  group-delay 
measurements  seem  to  drift  relative  to  the  phase  delays.  The 
application  of  the  phase-delay  ambiguity  prediction  techniques 
to  observations  from  a  geodetic  experiment  yielded  results 
which  indicated  that  the  systematic  variations  of  the  group 
delays  relative  to  the  phase  delays,  was  mostly  independent  of 
the  radio  source  being  observed  and  the  orientation  of  the 
radio  telescope.  These  results  indicate  that  the  group  delay 
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errors  probably  arise  from  the  Mark  III  equipment,  rather  than 
from  external  sources,  such  as  antenna  deformation  and  source 
structure.  These  latter  two  effects  do  affect  the  apparent 
group  delay  errors,  but  they  to  not  appear  to  be  the  major 
source  of  these  errors. 

A  compilation  of  the  statistics  of  the  implied  group 
delay  errors  i.e.,  the  differences  between  the  observed  values 
of  the  group  delays  and  the  estimates  of  these  values  calcu¬ 
lated  from  the  phase  delays,  indicate  that  the  standard  devia¬ 
tions  of  the  group  delays,  computed  from  the  assumption  of 
Gaussianly  distributed  random  noise,  are  deficient  in  two 
respects.  Firstly,  there  appears  to  be  a  proportional  error 
in  the  computed  standard  deviations  of  between  1.1  and  1.2, 
which  is  probably  due  to  correlator  non-reproducibility  and 
the  dispersion  in  the  system.  Secondly,  there  appears  to  be  a 
threshold  performance  limit  which  cannot  be  penetrated,  even 
with  very  high  SNR  observations.  At  X-band  (=8.34  GHz) ,  this 
limit  was  0.040  nsec  (*1.2  cm)  and  0.015  nsec  (“0.5  cm),  for 
the  two  experiments  analyzed  (see  Figure  4.3.8).  The  S-band 
(*2.3  GHz)  system  has  a  performance  limit  of  0.15  nsec  (=4.5 
cm)  for  both  of  the  experiments  analyzed.  The  difference  in 
the  performance  of  the  X-  and  S-band  systems  can  be  partially, 
but  not  totally  (see  Table  3.1.1),  explained  by  the  difference 
in  the  spanned  bandwidths  in  these  two  frequency  bands. 
(Typically  300  to  350  MHz  is  spanned  at  X-band  while  only 
75-80  MHz  is  spanned  at  S-band.) 

We  also  presented  results  which  show  intercontinental 
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distance  measurements  with  a  repeatablity  of  2.0  cm,  from  40 
experiments  spanning  3  years,  after  a  "best-fit"  linear  trend 
was  removed  from  the  data.  Studies  of  the  solutions  which 
generated  these  results  show  that  the  accuracy  of  the  length 
estimate,  from  the  ensemble  of  experiments,  is  at  the  several 
centimeter  level  rather  than  at  the  sub-centimeter  level  which 
is  implied  by  the  repeatability  and  the  number  of  experiments 
in  the  ensemble. 

Studies  of  the  the  dependencies  of  the  baseline  length 
estimates  on  the  minimum  elevation  angle  of  observations 
included  in  the  data  set,  have  indicated  that  the  relationship 
we  used  between  the  elevation  angle  of  the  observation  and  the 
airmass  through  which  the  signal  propagated,  may  be 
systematically  biased. 

We  need  to  determine  "realistic"  standard  deviations  of 
the  baseline  length  estimates.  In  nearly  all  cases  the 
weighted-root-mean-square  (WRMS)  scatter  of  the  baseline 
length  estimates  about  the  global  solution  values  were  1.5  to 
2.0  times  the  amount  expected  from  their  standard  deviations. 
There  are  two  reasons  for  this  under  estimation  of  the 
standard  deviations  of  the  baseline  length  estimates. 

Firstly,  unmodelled  or  inadequately  modelled  effects  such  as 
water  vapor  delay,  radio  source  structure,  and  ocean  loading 
will  contribute  to  the  variability  of  the  baseline  length 
estimates.  These  variations  are  not  totally  accounted  for  in 
the  least  squares  estimates  of  the  standard  deviations  (to 
some  extent  they  are  accounted  for  through  the  reweighting  of 


the  observations) .  Secondly,  there  are  probably  correlations 
between  the  pseudo-noise  contributions  to  the  observations 
from  site  dependent  (unmodelled)  systematic  errors.  He 
currently  do  not  account  for  these  correlations  in  the  least- 
squares  solutions,  which  will  cause  the  uncertainities  of  the 
parameter  estimates  to  be  too  small. 

VLB I  data  acquisition  and  analysis  has  progressed 
considerably  in  the  past  14  years.  The  precision  of  baseline 
length  determinations  has  decreased  from  the  meter  level  to  a 
few  centimeters.  The  factors  that  have  limited  the  precision 
of  the  baseline  length  estimates  has  also  changed  considerably 
over  this  period  of  time.  Initially,  the  equipment  was  the 
major  limitation  (Whitney,  1974)  .  Before  the  introduction  of 
the  Mark  III  dual-frequency  band  system,  the  ionospheric  delay 
was  probably  the  major  limiting  error  source  (Robertson, 

1975)  .  Currently,  it  is  not  clear  which  deficencies  in  the 
data  analysis  or  the  equipment  are  the  major  error  sources. 

To  considerably  improve  upon  the  current  precision  will  prob¬ 
ably  require  the  solution  of  a  number  of  distinct  problems. 

The  delay  due  to  atmospheric  water  vapor  is  likely  to  be  the 
most  serious  error  source  at  the  moment.  Recent  improvements 
in  the  water  vapor  radiometer  data  acquisition  techniques  may 
allow  this  delay  to  be  accurately  determined.  However,  there 
are  other  sources  of  errors  which  are  probably  important,  and 
also  need  to  be  studied.  These  error  sources  include 
frequency  standard  instabilities,  instrumental  errors,  radio 
source  brightness  distribution,  oceaning  loading,  the  dry 


atmospheric  delay  and  antenna  deformations. 

In  14  years,  the  precision  of  VLBI  baseline  length  deter 
minations  has  improved  by  almost  two  orders  of  magnitude. 

With  further  study  and  development,  we  should  be  able  to 
improve  the  precision  by  another  factor  of  three  in  the  next 
14  years. 


The  delay  resolution  function  (Rogers,  1970)  is  one  of 
the  most  important  functions  in  geodetic  VLB1.  Maximization 
of  this  function  provides  a  means  of  estimating  the  group 
delay,  phase  delay  and  phase  delay  rate.  (These  will  be  refer¬ 
red  to  as  the  VLBI  observables.)  As  we  approach  being  able  to 
measure  intercontinental  distances  with  1  cm  precision,  it 
becomes  critical  that  we  fully  understand  the  interaction 
between  these  estimation  algorithms  and  the  VLBI  instrument¬ 
ation  and  environment.  In  this  appendix,  the  relationship 
between  the  usual  method  of  maximization  of  the  delay  resolu¬ 
tion  function  and  a  least-squares  approach  to  the  estimation 
problem  will  be  described.  We  introduce  the  least-squares 
approach  because  it  allows  quantitative  determination  of  the 
effects  of  phase  errors,  dispersions,  and  data  loss  on  the 
VLBI  observables. 

Before  developing  the  equivalence  of  the  maximization  of 
the  delay  resolution  function  and  the  least  squares  estimator, 
we  review  the  concepts  of  group  delay,  phase  delay  and  phase 
delay  rate.  We  then  review  the  development  of  the  delay 
resolution  function  which  is  shown  to  be  a  maximum  likelihood 
estimator,  highlighting  the  assumptions  made  during  the  deri¬ 
vation.  Finally,  before  showing  the  equivalence,  we  consider 
some  of  the  practical  details  of  computing  the  maximum  likeli¬ 
hood  or  least  squares  estimates  of  VLBI  observables. 
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The  maximum  likelihood  estimator  for  the  VLBI  observables 
is  given  by  Rogers  (1970)  and  developed  in  detail  by  Whitney 
(1974).  The  review  given  here  will  follow  closely  the  devel¬ 
opment  of  Whitney  with  some  modifications. 

Two  radio  telescopes  are  assumed  to  be  observing  an 
emitter  of  electromagnetic  energy  at  radio  frequencies.  The 
output  of  noiseless  receivers,  in  noiseless  surroundings,  at 
the  two  telescopes  will  be  given  by  /Ta^  s1(t)  and  /Ta2  s2(t) 
where  Ta^  and  Ta2  are  the  antenna  temperatures  at  each  site 
and  the  dimensionless  functions  s^(t)  and  s2(t)  are  assumed  to 
be  white  noise  with  zero  mean  and  unit  variance.  The  antenna 
temperatures  are  simply  a  convenient  method  of  representing 
the  power  being  received  by  each  antenna  from  the  radio 

source.  In  terms  of  the  source  strength  S0  (given  in  Janskys, 

—  26  2 

where  1  Jansky  *  10  w/m  -Hz)  ,  the  antenna  temperature  for 
observations  in  one  polarization  from  a  broadband,  unpolarized 
source  will  be  (Krauss,  1966) 

Ta  *  Ae  Sc  /  2k  (A.l) 

where  A_  is  the  effective  area  of  the  antenna  and  k  is 
e 

-23 

Boltzmann's  constant  (1.38x10  Joule/K) . 

For  the  purposes  of  this  section  it  is  more  convenient  to 
use  the  spectra  of  s^t)  and  s2(t)  which  will  be  denoted  by 
S^fw)  and  S 2  ( b>)  .  The  vector  symbol  denotes  complex  quanti¬ 
ties.  We  assume  that  real  and  imaginary  components  of  the 
spectra  are  independent  and  Gaussianly  distributed.  While 
these  may  seem  somewhat  arbitrary  assumptions,  the  central 
limit  theorem  of  probability  ensures  that  the  sum  of  many 


small  independent  sources  of  energy  will  indeed  be  close  to 
Gaussianly  distributed,  independent  of  the  original  probabil¬ 
ity  distributions  obeyed  by  the  individual  energy  sources  (see 
Section  2.1).  We  further  assume  that  the  spectral  components 
at  each  sampled  frequency  w  are  independent  of  each  other. 
There  are  two  aspects  to  this  assumption.  Since  we  are  only 
able  to  observe  the  signal  using  a  finite  bandwidth  B,  and  for 
a  finite  duration  T,  the  Nyquist  sampling  theorem  states  that, 
provided  we  sample  the  signal  at  the  Nyquist  rate  (1/2B) , 
there  are  only  BT  independent  complex  frequency  components  of 
the  spectra  which  can  be  determined  from  the  data.  The 
processing  algorithms  used  in  the  correlator  ensure  that  the 
correct  number  of  cross  spectral  components  are  computed 
(although  as  we  will  see  later,  these  components  are  not 
computed  at  the  minimum  frequency  spacing  determined  by  the 
duration  of  the  data) .  The  recording  equipment  is  set  to 
sample  the  data  at  the  correct  rate.  The  second  aspect  of 
this  assumption  is  that  the  original  spectra  must  have 
independent  components  at  the  frequencies  which  we  are  able  to 
sample.  If  this  assumption  is  to  be  valid  then  the  phases  of 
the  spectral  components  of  the  signals  should  vary  randomly 
between  frequencies.  (We  are  considering  the  spectrum  of  the 
signal  not  the  cross  spectrum  of  the  data  recorded  at  each 
site.  The  cross  spectrum  phase  at  each  frequency  will  be 
related  to  the  difference  in  arrival  time  of  the  signals  at 
each  site.)  Since  the  signal  spectrum  is  the  sum  of  the 

4  7 

spectra  of  the  emission  from  many  electrons  (at  least  10 
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electrons;  see  Section  2.1),  the  phase  of  this  spectrum  should 
vary  randomly. 

The  signals  from  which  S^(to)  and  S2(w)  are  determined 
originate  from  a  common  source  and  hence  should  be  related  if 
these  signals  are  spatially  coherent  on  scales  of  several 
thousand  kilometers.  Thus,  we  assume  that  for  each  frequency 
we  can  define  a  wavefront  such  that  the  phase  of  any  spectral 
component  of  the  signal  remains  constant  as  we  move  along  this 
wavefront.  Since  the  properties  of  the  propagation  medium 
between  any  point  on  the  wavefront  and  the  source  may  change 
as  we  move  along  the  wavefront,  the  wavefront  is  not  necess¬ 
arily  plane  even  for  sources  an  infinite  distance  from  the 
radio  telescopes. 

The  observed  spectra  are  /tfa-^S^  (u>)  and  /TiT^Sj  (w)  and  the 
relationship  between  these  spectra  can  be  expressed  as  §2(w)  = 

_  .  i 

S ^  ( <*>)  e  w  where  is  a  phase  change  which  accounts  for  the 
difference  in  arrival  times  of  the  signal  at  the  two  sites  (we 
will  neglect,  for  the  moment,  that  is  a  function  of  time 
because  of  the  rotation  of  the  Earth) .  This  phase  change  in 
the  general  case  is  given  by 

l2  ?! 

♦»  *  1  -  /  V*>-dJ  (A-2> 

0  0 

where  the  integration  is  along  the  ray  path,  ku(r)  is  the  wave 
vector  (see,  e.g.,  Jackson,  1975,  Chapter  7)  which  may  be  a 
function  of  position  and  frequency,  and  and  ?2  are  the 
position  vectors  of  the  receivers  with  respect  to  the  source. 


If  we  assume  R^tr)  is  independent  of  position,  then  Equation 

(A. 2)  reduces  to  $  ■  1c  .Ar  where  Ar  «  r,  -  r,  . 

w  w  2  1 

We  now  consider  a  simple  case  where  the  radio  waves  have 
been  emitted  from  a  point  source,  an  infinite  distance  from 
the  radio  telescopes,  and  have  propagated  through  a  homogene¬ 
ous  nondispersive  medium.  For  this  case  the  wave  vector  ku 
will  be  given  by 


where  Vp  is  the  phase  velocity  of  the  propagation  (independent 
of  frequency  in  this  case,  by  definition)  and  Sg  is  a  unit 
vector  in  the  direction  from  the  receiver  to  the  point  source. 
The  phase  delay,  t  ,  will  be  -eg.Ar/Vp  and  hence  the  phase 
difference,  ♦a),  between  corresponding  spectral  components 
becomes  ♦  =  ut  .  We  could  use  this  formula  directly  to  find 

CO  P 

the  phase  delay  by  comparing  the  spectra  of  the  signals 
received  at  two  sites  and  determining  the  difference  in  phase 
between  the  corresponding  spectral  components.  An  alternative 
(and  equivalent)  approach  would  be  to  multiply  the  spectra 
together  to  form  the  cross  spectrum.  The  phase  of  the  cross 
spectrum  (neglecting  the  noise  contribution)  at  frequency  co 
would  be  However,  we  are  only  able  to  measure  modulo 

2n  radians.  We  have  no  direct  means  of  determining  the  total 
number  of  cycles  through  which  the  phase  has  turned  as  the 
signal  propagated  from  the  source  to  each  of  the  sites.  If  we 
were  to  use  a  priori  information  about  the  magnitude  of  to 
resolve  the  number  of  cycles  or  ambiguities,  the  initial 
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estimate  of  tp  would  need  to  be  sufficiently  accurate  that  we 
could  predict  to  within  much  less  than  *  radians.  For 
observations  made  at  8  GHz,  this  condition  requires  the  error 
in  the  a  priori  value  of  t  to  be  much  less  than  0.06  nsec, 
equivalent  to  =2  cm.  In  general,  this  condition  cannot  be 
met.  Chapter  4  deals  with  some  attempts  to  eliminate  this 
ambiguity. 

To  overcome  the  ambiguity  problems  with  the  phase  delays, 
the  bandwidth  synthesis  technique  was  developed  (Rogers, 

1970) .  The  foundation  of  this  technique  is  that  while  the 
total  number  of  cycles  at  a  given  frequency  may  not  be  known, 
the  number  of  cycles  between  two  relatively  close  frequencies 
may  be  calculated,  with  uncertainty  far  less  than  half  a 
cycle,  from  a  priori  information.  The  delay  is  then  found 
from  A<t>/ Aw*  As  Aw  tends  to  0,  a$/Aw  approaches  a<t>/aw  which  is 
the  group  delay. 

Using  the  group  delay,  we  could  model  the  observed  phase 
with  =  wTg,  but  this  expression  is  inadequate  for  two 
reasons.  If  the  radio  waves  have  propagated  through  different 
dispersive  regions  (e.g.  the  ionosphere)  before  arriving  at 
each  site,  then  the  phase  delay  and  group  delay  will  not  be 
equal  (see  Section  4.1)  and  the  projection  of  the  group  delay 
to  zero  frequency  will  not  pass  through  the  origin  (Figure 
A.l.a),  a  condition  imposed  by  the  above  form  of  $  .  Even 
without  dispersive  effects,  the  uncertainty  in  the  number  of 
cycles  and  the  value  of  the  phase  at  each  frequency  will  lead 
to  a  similar  problem  (Figure  A.l.b) .  To  avoid  these  problems 


Figure  A.l  Graphical  representations  of  group  delay  and  visibility  phase 
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we  model  the  observed  phase  using 

<t>  =  + 

u>  g  o 

where  is  a  phase  offset  referred  to  zero  frequency. 

To  more  readily  see  the  meaning  of  in  the  above 
expression,  we  can  write  an  equivalent  expression  for  the 
observed  phase:  41  =  (oj-oj,.)  x_  +  where  is  the  phase 

CO  O  y  t  L 

referred  to  frequency  uQ.  This  phase  is  called  the  visibilty 
phase.  is  now  related  to  the  phase  delay  by  u)0tp  “  4>t  + 

2-rcN,  where  N  is  the  number  of  integer  cycles  of  phase  needed 
to  achieve  equality. 

The  discussion  so  far  has  assumed  that  the  differences  in 
phase  between  the  corresponding  spectral  components  of  the 
signal  recorded  at  two  sites  are  independent  of  time.  In 
reality,  the  two  sites  are  located  on  a  rotating  earth  and  the 
phase  differences  will  change  with  time.  To  account  for  this 
effect  we  introduce  the  phase  delay  rate  *  .  The  phase  can 
now  be  modeled  as 

❖  ^  =  ( (1)—  (i)Q )  Xg  +  +  u^p(t-tc)  (A. 3) 

where  t  is  the  epoch  of  the  observation  (i.e.  the  time  to 
which  x  and  are  referred)  and  _t  is  the  epoch  of  the  phase 
determination.  In  this  case  we  can  use  the  phase  delay  rate 
directly,  without  resorting  to  a  group  delay  rate,  because  we 
are  only  interested  in  the  change  of  phase  with  time  and  do 
not  need  to  know  the  total  number  of  ambiguities  at  any  time. 

In  summary,  we  started  with  the  spectra  of  signals  at 


each  site  which  were  related  by 
§2(w)  =  S^(w)  e  * 

and  now  we  have  a  relationship  between  6  and  the  VLBI  obser- 

w 

vables  (Equation  A. 3)  for  the  case  of  the  radio  waves  origin¬ 
ating  from  a  point  source  and  propagating  through  a  homogene¬ 
ous  non-dispersive  region.  In  terms  of  the  VLBI  observables 
the  relationship  between  the  spectra  becomes 

§2  (u)  =  («)  expt-i  (w-u>0)  Tg  -  i$fc  -  iuTpAt]  (A. 4) 

where  At= t- t  . 

c 

We  now  investigate  how  to  determine  t  ,  4>.  and  x  from 

g  t  p 

the  signals  recorded  at  each  site,  /Ta^s^t)  and  /¥a^s2(t). 

The  data  recorded  at  each  site  consist  not  only  of 
/Ta^s^ (t)  and  /Ta^s2(t)  but  also  of  additive  noise  due  mostly 
to  the  receivers.  Returning  to  the  frequency  domain,  we  can 
write  the  spectra  of  the  data  recorded  at  each  of  two  sites  as 

Xj.U)  =  /Ta^U)  +  /Ts^N^o) 

X2(uj)  =  *'l^^2®2  (u>)  +  y^S2^2  ^ 

where  /Fs'^N^((jJ)  and  /FsF^N^u)  are  the  noise  spectra  at  each 
site,  and  Ts1  and  Ts2  are  the  "off-source"  system  tempera¬ 
tures.  As  with  the  antenna  temperature,  the  system  tempera¬ 
ture  is  a  convenient  method  of  measuring  the  noise  power.  We 
assume  the  components  of  the  noise  spectra  have  Gaussianly 
distributed,  independent  real  and  imaginary  components.  The 
components  at  each  frequency  are  also  assumed  to  be  indepen¬ 
dent.  These  assumptions  can  be  justified  by  the  same  argu- 
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ments  used  for  the  spectra  from  the  source. 

Before  progressing  further  it  is  convenient  to  change 
notation  to  reduce  the  number  of  symbols  used  in  subsequent 
equations.  We  define 
3  ( to)  2  /Ta^ 

KS'  (w)  2  /Ta^  S2  (w) 

where  K  *  y/Ta2/Ta1.  The  scaling  K  ensures  that  the  components 

of  3(w)  and  3' (w)  have  the  same  variance,  which  is  equal  to 

Ta^/2.  The  noise  spectra  are  similarly  redefined 

NJ  =  /fs^  (oj) 

N2  2  /Ts^  N2(w)  . 

Also  for  convenience  we  will  define 

a  2  (oj-w  )x  +  A  +  wt  At  . 
o  g  t  p 

Using  the  new  symbols  the  expression  for  the  spectra  of  the 
recorded  data  at  each  site  will  be 

X.  (u>)  =  S  (u>)  +  N'  (u) 

(A. 5) 

X2  (w)  =  KS'  (cj)  +  (u) 

and  the  relationship  between  spectra  will  be  S'  (to )  = 

§(w)e-la.  All  of  the  spectra  given  in  Equation  (A. 5)  vary 
with  time.  However,  this  time  dependence  does  not  mean  that 
we  cannot  determine  estimates  of  the  above  spectra.  Any 
finite  duration  or  data  can  always  be  expanded  in  Fourier 
series.  In  a  later  part  of  this  appendix  and  in  Appendix  B  we 
will  discuss  the  determination  of  these  spectra.  (We  will  see 
later,  that  the  spectra  of  the  signals  recorded  at  each  site 
never  need  be  computed,  only  the  cross  spectrum  of  the  signals 
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is  computed.) 

We  now  wish  to  develop  an  estimation  technique  to  deter¬ 
mine  rg,  and  tp  given  the  spectra  X^w)  and  X2(u>)  which  we 
can  determine  at  a  number  of  different  frequencies  and  times. 
For  an  observation  of  recorded  bandwidth  B  and  duration  T,  a 
maximum  of  BT  independent  estimates  of  the  components  of  X^(w) 
and  X2(u)  can  be  calculated,  assuming  that  the  signals  have 
been  sampled  at  the  Nyquist  rate  1/2B.  If  we  observe  for  100 
seconds  with  a  total  recorded  bandwidth  of  16  MHz,  then  the 
number  of  independent  estimates  of  the  components  of  the 

q 

spectra  is  1.6x10  (a  very  large  number). 

There  are  many  estimation  techniques  we  could  use  to  find 

estimates  of  t  ,  and  x  .  Of  these  we  will  develop  the 
g  t  p 

Maximum  Likelihood  (ML)  estimate.  This  technique  requires  a 
priori  knowledge  of  the  joint  probablity  density  function 
(PDF)  of  the  observations  (in  our  case  X1 (w)  and  X2(w))  which 
are  functions  of  the  VLBI  observables.  The  ML  estimates  of 

A 

tg ,  4>t  and  tp,  which  will  be  denoted  by  *g,  and  tp,  are 
estimated  by  maximizing  the  conditional  PDF  p (X^,X2 | Tg ,$t ,*p) 
where  tg,  and  t  are  the  values  to  be  varied  until  the 
conditional  PDF  is  maximized.  The  values  of  xg,  and  tp 
which  maximize  the  conditional  PDF  are,  as  stated,  the  maximum 

A 

likelihood  estimates  xg ,  and  tp. 

At  this  stage  the  conditional  PDF  we  need  is  not  avail¬ 
able.  It  will  have  to  be  derived.  We  commence  with  the  PDF's 
which  we  do  know  (or  strictly,  have  assumed) .  From  the 
recorded  data  we  have  BT  spectral  components.  For  each  of 
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these  components  the  signal  and  noise  values  will  be  denoted 
by  Sjf  N[j  and  (The  index  j  implicitly  implies  that  these 

components  are  functions  of  both  frequency  and  time;  see 
Section  2.2  and  Appendix  B  for  discussion.)  Since  we  have 
assumed  Gaussian  distributions  for  the  real  and  imaginary 
parts  of  the  signal  and  noise  components,  the  individual  PDF's 
will  be 

P(3i  )  -  - — j  exp(  -ISJ2/  2(o  )2) 

3  2,(o,)2  3  3 

*  t~t  ,2  exp(  2<»„i)2) 

2*  °nl  j 

P«S2j>  ■  — ~ ,-2'  exp'  -|H2j|2/  2<»n2>?> 

2,1  °n2  j 

2  2  2 

where  (°s)j»  (°nl)j  and  *°n2*j  are  the  variances  of  each  of 
the  real  and  the  imaginary  components  of  the  respective 
signals  and  noises. 

Since  all  the  signals  and  noises  have  independent  compon¬ 
ents  we  can  form  their  joint  PDF,  p(3,R^,R£),  which  is  given 
by  the  product  of  their  individual  PDF's. 


BT 

p(S,N^,N^)  =  n 


j«l  <2*)3(os)2(onl)2(on2)2 
exp[  -|Sj|2/2(os)2-  |Slj|2/2(onl)2  - 


N 


2j 


I 2/2 (an2) 


Remembering  that  the  observed  spectra  are  and  Xjr  we  can 
substitute  for  and  R^  in  the  above  PDF  using  R^  *  -  S 

and  Si  *  X,  -  KSexp(-iS^)  where  a.  is  defined  to  be  (u-w  )t  + 


it  +  u^pAtj 


Hence  we  obtain 


BT  . 

p(XlfX2,S  I  V*t'xp  )  “.n.  (0  3.  7Tj  7Tt  72  * 

D=1  (2«)  («s)j(^nl)j(<,n2)j 


exp[ 


.  ISjlf  -  l»»-  Sll2  .  I«21  -  KS-^1*!2  J  (i 


2<«s>2 


2 


2<‘n2>j 


The  above  expression  is  not  directly  usable  because  it  still 
depends  on  the  spectrum  5  of  the  signal  which  can  not  be 
directly  observed  (because  of  the  presense  of  the  noise) .  T 
desired  conditional  PDF  can  be  generated  from  Equation  (A. 6) 
by  integrating  over  all  possible  values  of  S  to  form  a  mar¬ 
ginal  PDF  (see  e.g.  Drake,  1967,  pp.  69-72,  for  a  discussion 
on  generating  marginal  PDF's): 


P<V*2I  VVV  mJ  p(VV§  i  vvv  d§ 


The  details  of  carrying  out  this  integration  are  given  by 
Whitney  (1974,  pp.  64-65) .  The  result  is 


BT 


P  (Xi  »X2  |  :'g'$t'Tp*  =  11 

1  =  1 


2 
a . 


3-1  <2’>2'»s>jl'’nl>j<‘’n2»j 


r .  i^ui2 .  i*2jr  ♦  iJii  , 
exp[  — a-_2  ) 


2(’nl>2  2‘«n2>j 


(A. 7) 


2  n. 


where  Oj  = 


'”,»2(''nl»2«'’n2>2 

- - - - - < - - -  - * - *■ - - - y 

(a  .)  .(a  -)  •  +  (c  ) 7(o  0) 7  +  K(a  )  . ( a  ,)  . 

'  nl'i'  n2'i  '  s' 3  n2  3  S  3  nl  3 
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and  is  the  complex  conjugate  of  X^j,  i=l,2. 

Although  Equation  (A. 7)  looks  very  complicated  (and  not 
very  close  to  a  usable  form) ,  its  interpretation  is  quite 
simple.  Remembering  that  X^j  and  X2j  are  the  observed  quant¬ 
ities,  we  can  choose  a  set  of  trial  values  for  x  »  J.  and  x 

9  t  p 

which  would  determine  the  complex  vector  7^  (assuming  all  the 
sigma's  are  known).  We  would  then  form  the  product  over  the 


BT  values  to  determine  p(X1 ,X2 | xg,$fc, *p) .  When  this  operation 
is  performed  for  many  sets  of  trial  values  of  and  t  , 

the  set  giving  the  maximum  value  for  p(XlfX2 I wil1  be 

A 

the  maximum  likelihood  estimates  xg»  and  Xp. 

We  can  save  a  great  deal  of  time  in  computing  the  product 

(A. 7)  by  noting  that  | j | 2/2 { anl ) ]  and  |X2 j | 2 /2 ( on2) 2  are  not 

affected  by  our  choice  of  xg,  and  xp  and  hence  will  not 

_  O 

affect  the  maximization  process.  Also,  by  expanding  |Y^|  we 
obtain 


|  Y  j  | 2  =  KihA  +  ♦  2a^K  ReCx.-X^e-^j  } 


(‘nl>j 


<*n2>5  (onl)5(°n2)j 


in  which  again  the  first  two  terms  are  not  affected  by  the 
choice  of  and  Xp*  To  find  the  maximum  likelihood 

estimate  it  is  thus  sufficient  to  maximize 


i-1  <0s>5<°nl>j<°n2>j 


exp[- 


Re{  X1:.X2je'iaj}] 


<0nl>;<0n2>j 


This  expression  may  be  further  simplified  by  taking  the 
natural  log  of  the  product,  which  yields  a  summation  to  be 


maximized 


BT  BT  aZ.  K 

Eln(  2  2  2  2 

3*1  <‘s>jKl>j<‘n2>j  ^  Kl>j<*n2>j 


BT 


iK 


Re{ 


XljXj je_iaj 


The  first  summation  will  not  affect  the  maximization  and  we 
are  left  with  maximizing 


BT 

Z 

j=l 


2„ 

14 


(anl) j(an2): 


7  Re{  RljX2je"i53> 


(A. 


to  find  the  maximum  likelihood  estimates  of  Xgr  and  Xp* 

At  this  stage,  it  is  normally  assumed  (Whitney,  1974) 
that  the  sigmas  given  in  Equation  (A. 8)  are  independent  of  j 
and  hence  can  be  taken  outside  the  summation  reducing  the 
expression  to  be  maximized  to 
BT  -  -*  -iS 

^  Re{  XJL.X2.e  1Cfj}  (A. 

Equation  (A. 9)  is  very  close  to  the  formula  used  by  the  Mark 
III  correlator  and  processing  software  (FRNGE) ,  to  estimate 
the  VLB I  observables  Xg*  and  Xp.  If  we  examine  Equation 
(A. 9)  more  closely,  we  can  derive  the  actual  formula  used  by 
the  Mark  III  processing  programs.  Remembering  that  lij  * 
(u)j-a>0)Xg  +  ?t  +  WjTpAtj,  we  see  that  Equation  (A. 9)  may  be 
rewritten  as 


Re{  e~i^ 


BT 


fc  jli  XljX2jexpt-i(uj-wo)^g 


-  iwjTpAtj] } 


where  has  been  taken  outside  the  summation  because  it  is 
not  a  function  of  the  cross  spectral  frequencies.  Equation 
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(A.9)  can  be  maximized  by  maximizing  the  magitude  of 


_  _  * 


XljX2j  expt  -  ‘“jW 


(A. 10) 


and  rotating  the  value  of  Equation  (A. 10) ,  evaluated  at  *  and 

A 

tp ,  by  such  that  it  is  then  purely  real  (rotated  back  onto 

the  real  axis) .  Equation  (A. 10)  when  divided  by  BT  is  called 
the  delay  resolution  function.  Use  of  the  delay  resolution 
function  instead  of  Equation  (A. 9)  leads  to  a  considerable 
saving  in  computation  since  it  is  only  necessary  to  search 
over  two  parameters,  t^  and  Tp,  instead  of  the  three  para¬ 
meters  which  would  be  needed  if  Equation  (A. 9)  were  used. 

The  maximization  of  Equation  (A. 8)  represents  the  maximum 
likelihood  estimator  of  the  VLBI  observables  when  the  noise 
statistics  are  assumed  to  be  Gaussian.  Equation  (A. 10)  which 
is  used  by  the  Mark  III  processing  software  makes  a  further 
assumption  that  the  standard  deviations  of  the  Gaussian 
statistics  are  independent  of  frequency  and  time.  The 
equivalence  of  the  ML  estimator  and  a  least  squares  estimator 
will  be  shown  for  Equation  (A. 8).  The  relationship  between 
the  Mark  III  algorithms  and  the  least  squares  estimator  will 
then  be  shown. 


The  input  for  the  delay  resolution  function  is  the  cross 
_  _  * 

spectral  components  The  derivation  of  the  delay 

resolution  function  assumed  that  these  cross  spectral  compon¬ 
ents  were  available  near  the  radio  frequency  (RF)  for  the 
observation.  As  discussed  in  Section  2.1,  the  signals  record¬ 
ed  at  each  site  have  been  heterodyned  from  RF  to  video  fre- 


quencies  so  that  the  signals  can  be  sampled.  The  required 
input  for  the  delay  resolution  function  can  be  easily  gener¬ 
ated  from  the  video  signals  because  the  heterodyning  only 
translated  the  spectrum  in  frequency.  (There  were  local 
oscillator  phases  added  to  the  phases  of  the  spectra  from  each 
site,  but  these  local  oscillator  phases  are  calibrated  by  the 
phase  calibration  system;  see  Section  2.3.)  In  the  discussion 
that  follows  we  will  compute  the  cross  spectrum  from  data 
recorded  at  video  frequencies.  The  frequencies  of  the  cross 
spectrum  will  have  the  local  oscillator  frequencies  added  to 
them  to  produce  the  correct  cross  spectral  frequency. 

Before  proceeding  further,  it  is  wise  to  look  at  some  of 
the  practical  aspects  of  performing  the  summation  in  either 
Equation  (A. 8)  or  (A. 10).  In  most  Mark  III  geodetic  experi¬ 
ments  the  recorded  bandwidth  is  typically  16  MHz  (consisting 
of  eight,  2  MHz  channels)  and  the  observation  of  a  single 

source  is  usually  of  100  seconds  duration.  For  these  values, 
g 

3.2x10  equi-spaced  data  samples  are  recorded  at  each  site 

g 

participating  in  the  observation.  From  3.2x10  real  data 
g 

samples,  1.6x10  complex  components  of  the  cross  spectrum  can 
be  calculated.  The  frequencies  at  which  these  components  are 
evaluated  are  however  not  arbitrary.  The  frequency  spacing 
will  be  the  inverse  of  the  duration  of  the  observation.  The 
highest  frequency  component  which  can  be  estimated  will  be  the 
inverse  of  twice  the  sampling  interval.  In  addition,  since 
the  observations  are  recorded  at  a  number  of  widely  separated 
frequencies  the  cross  spectrum  can  only  be  computed  at  fre- 
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quencies  within  the  bandwidths  which  have  been  sampled. 
Therefore  within  each  2  MHz  band  which  has  been  recorded  we 

g 

can  compute  2x10  complex  components  of  the  cross  spectrum. 

We  restrict  the  following  discussion  to  a  single  such  band. 

A  cross  spectrum  computed  directly  from  the  recorded  data 
may  not  be  very  meaningful  because  of  the  rotation  of  the 
earth.  For  each  frequency  the  phase  of  the  cross  spectrum 
will  be  a  function  of  the  difference  in  arrival  times  of  the 
signals  from  the  radio  source.  As  the  earth  rotates  this 
phase  will  change.  If  the  difference  in  arrival  times  of  the 
signals  were  changing  at  a  constant  rate  then  the  phases  of 
each  frequency  component  would  be  changing  at  a  constant  rate 
which  would  be  equivalent  to  a  change  in  the  frequency  of  each 
component  in  the  cross  spectrum.  Even  this  situation  does  not 
apply  to  geodetic  VLBI  because  the  acceleration  of  the  differ¬ 
ence  in  arrival  times  is  not  negligible.  For  a  trans-conti¬ 
nental  baseline  this  acceleration  term  could  change  the  fre¬ 
quencies  in  the  cross  spectrum  by  up  to  1  Hz/sec  (w0=8  GHz) . 
The  resolution  in  frequency  for  a  100  sec  observation  would  be 
0.01  Hz  and  the  change  in  frequency  introduced  by  the  accel¬ 
eration  would  be  a  severe  problem.  In  order  to  solve  this 
problem  we  compute  the  cross  spectrum  using  shorter  intervals 
of  time.  Hence  we  subdivide  the  100  sec  observation  into  seg¬ 
ments.  From  each  segment  of  data  we  can  obtain  estimates  of 
the  components  of  the  cross  spectrum  and  then  add  the  cross 
spectral  estimates  together  accounting  for  both  the  Doppler 
shift  and  the  change  in  Doppler  shift  for  each  segment  of 


data.  Since  the  duration  has  been  shortened  the  frequency 
spacing  of  the  cross  spectral  components  which  can  be  computed 
will  be  coarser  than  if  we  had  used  the  full  100  sec.  This  is 
not  a  severe  problem  because  the  power  spectra  of  the  signals 
and  noise  are  almost  constant  over  the  2  MHz  recorded  band¬ 
width  (see  Section  2.1)  and  the  expectation  of  the  estimates 
of  the  components  of  the  power  spectra  will  not  change  rapidly 
between  adjacent  frequencies  (the  estimates  themselves  could 
change  rapidly,  however) . 

For  a  typical  geodetic  observation  what  should  be  the 
duration  of  the  segments?  A  cursory  consideration  may  imply 
that  the  segment  length  could  be  almost  one  second.  If  a  one 
second  segment  of  data  were  used  to  compute  estimates  of  the 
cross  spectral  components  the  change  in  frequency  of  a  cross 
spectral  component  would  be  1  Hz  during  the  interval  of  time 
used  to  compute  the  cross  spectrum,  and  the  frequency 
resolution  would  also  be  1  Hz,  i.e.  there  would  be  a 
"smearing”  of  the  cross  spectrum  comparable  to  the  frequency 
resolution.  Unfortunately  this  duration  for  the  segment  is 
far  too  long.  To  see  why  this  is  so,  we  need  to  consider  the 
methods  used  to  compute  the  cross  spectral  components. 

Two  different  methods  could  be  used,  either  the  Fourier 
transform  of  the  cross  correlation  function  of  the  data  from 
two  sites,  or  the  product  of  the  Fourier  transforms  of  the  two 
individual  data  sets.  If  we  try  cross  correlating  the  data 
from  two  sites,  a  problem  develops  immediately  because  the 
delay  between  the  signals  is  changing  with  time,  and  hence,  as 


-285- 


we  cross  correlate  the  data,  the  peak  in  the  cross  correlation 
will  move  and  "smear"  the  peak  out.  The  peak  in  the  cross 
correlation  function  will  translate  at  the  group  delay  rate 
which  can  be  as  high  as  2  nsec/sec.  If  we  try  to  cross  cor¬ 
relate  data  for  too  long  an  interval,  the  data  will  decor¬ 
relate  because  of  this  changing  delay.  The  computation  of  the 
Fourier  transform  of  the  data  from  each  site  does  not,  at 
first,  seem  to  suffer  from  this  problem  because  the  changing 
delay  will  only  shift  the  frequency  of  the  spectrum.  With 
this  approach  the  problem  arises  when  the  product  of  the 
spectra  from  two  sites  is  computed.  Although  the  spectra  are 
computed  at  the  same  frequency,  the  frequencies  of  the  spectra 
are  Doppler  shifted  with  respect  to  each  other.  This  shift  in 
frequency  causes  problems  because  the  phase  of  the  spectral 
components  from  each  station  vary  randomly  with  frequency. 
(Even  if  we  assume  no  noise,  the  random  variation  in  phase 
occurs  because  the  signals  from  the  radio  source  are  them¬ 
selves  random  quantities.  It  is  only  the  phase  of  the  cross 
spectrum  which  varies  smoothly  with  frequency.)  This  random 
variation  of  phase  with  frequency  requires  that  the  proper 
frequencies  of  each  spectrum  must  be  carefully  matched  when 
the  product  of  the  spectra  is  formed  to  ensure  that  the 
correct  cross  spectral  phase  is  computed. 

To  choose  an  appropriate  duration  for  a  segment  of  data, 
we  need  to  consider  how  the  actual  cross  spectrum  changes 
during  the  interval  which  is  used  to  compute  the  estimates  of 
its  components.  The  shorter  the  duration  used  the  fewer  the 


number  of  frequency  components  we  will  be  able  to  determine, 
but  the  better  the  match  between  the  expectation  of  the 
computed  cross  spectrum  for  that  epoch  and  the  actual  cross 
spectrum.  Since  the  phase  of  the  signal  part  of  the  cross 
spectrum  changes  smoothly  with  frequency  and  since  its  ampli¬ 
tude  is  constant  we  do  not  require  very  fine  frequency  resolu¬ 
tion. 

We  will  use  4  jiSec  (16  data  samples)  as  the  duration  of  a 
segment.  This  choice  is  based  on  the  rapid  decay  of  the  cross 
correlation  function  (see  Appendix  B) .  The  use  of  the  short 
data  segments  is  not  the  technique  used  in  the  Mark  III  cor¬ 
relator,  but  this  concept  of  short  segments  will  provide  a 
convenient  method  for  developing  and  studying  the  actual 
methods  used  by  the  correlator  to  estimate  the  cross  spectral 
components.  In  Appendix  B  we  will  investigate,  in  detail,  the 
method  used  by  the  correlator  and  the  approximations  made  in 
developing  the  method.  The  4  nsec  segment  allows  eight 
complex  components  of  the  cross  spectrum  to  be  computed. 

These  components  are  spaced  at  0.25  MHz  intervals  over  the  2 
MHz  bandwidth.  The  cross  spectral  components  computed  from 
all  of  the  data  segments  then  form  the  input  for  the  delay 

resolution  function.  (For  the  sign  convention  adopted  in 

_  * 

deriving  the  delay  resolution  function,  would  be  the 

complex  conjugate  of  the  cross  spectrum  computed  from  the 

Fourier  transform  of  the  cross  correlation  function  defined  by 

Jx^ (t) %2  (t+t)dt,  where  x^(t)  and  x2(t)  are  the  data  from  each 
-  -*  , 

site.  xix2  can  be  9enerate<3  from  the  Fourier  transform  of 
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/x1  ( t)  x 2  (t-T)dt)  . 

For  a  given  frequency  we  can  consider  the  behavior  of  the 

cross-spectral  phase  with  time.  During  the  4  nsec  between  the 

midpoints  of  successive  intervals  used  for  cross  spectral 

determinations,  the  phase  of  the  signal  part  will  change  by  a 

maximum  of  23°  due  to  the  rotation  of  the  earth  (corresponding 

to  a  phase  delay  rate  of  2x10  6  sec/sec  and  an  observing 

frequency  of  8  GHz) .  In  the  current  formulation,  for  each  set 

of  search  parameters,  we  sum  over  all  BT  components  of  the 

cross  spectrum.  By  improving  the  estimate  of  the  phase  delay 

rate,  how  much  do  we  expect  to  improve  the  estimate  of  the 

phase  change  over  4  nsec?  The  a  priori  phase  delay  rate  is 

normally  known  to  better  than  10-11  sec/sec  and  hence  the 

improvement  in  the  estimate  of  the  phase  change  over  4  nsec 

o 

would  be  less  than  0.0001.  This  small  improvement  in  the 
value  of  the  phase  change  would  imply  that  we  need  only 
compute  the  change  once  (from  a  priori  information)  rather 
than  for  each  set  of  search  values.  Actually  the  improvement 
is  so  small  we  could  sum  together  several  thousand  estimates 
of  the  cross  spectrum  using  just  a  priori  information  without 
introducing  a  significant  amount  of  "phase  smearing."  This 
approach  leads  to  major  saving  in  computation  time.  We  are, 
in  effect,  removing  the  Doppler  shift  between  the  spectra  of 
the  data  from  each  site.  The  interval  of  time  over  which  the 
cross  spectral  components  from  segments  of  data  are  summed 
together  using  only  a  priori  information  is  called  an  accumu¬ 
lation  period. 
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This  presummation  approach  is  used  in  VLBI  data  process¬ 


ing  but  it  is  not  implemented  as  I  have  described  (i.e.  by 
summing  cross  spectral  components) .  There  are  two  methods  of 
computing  a  cross  spectrum,  either  by  finding  the  spectrum  of 
each  data  stream  and  multiplying  the  spectra  together  or  by 
cross  correlating  the  data  streams  and  determining  the 
spectrum  of  the  cross  correlation  function.  The  Mark  III 
correlator  uses  the  second  approach.  The  presummation  is 
implemented  by  summing  together  the  complex  cross  correlation 
functions  and  then  finding  the  spectrum  of  the  summed  cross 
correlation  function  (see  Appendix  B  for  details) .  The 
results  obtained  from  these  two  techniques  are  not  exactly 
equal.  Their  differences  will  be  discussed  in  Appendix  B. 

The  Mark  III  correlator  computes  the  complex  cross  cor¬ 
relation  coefficents  for  a  set  of  eight  lags.  The  cross 
correlation  coefficients  from  each  accumulation  period  are 
Fourier  transformed  and  the  maximum  likelihood  estimates  of 
the  VLBI  observables  are  determined  by  a  program  called  FRNGE 

The  presummation  may  be  included  in  Equation  (A. 8)  by 
writing  the  equation  as 


max 
wrt  5 


QJ 

l  Re{Zae"lfltq} 

q=i 


(A. 11 


where  Za  = 


n(q) 


r 

j*i“ 


^°nl^  j  ^°n2^  j 


X  X*e“i<aaP^ 

X1 jX2 je 


n(q)  is  the  number  of  estimates  of  the  cross  spectral  compo¬ 
nents  at  each  frequency  in  the  q  th  accumulation  period  of 
data,  Q  is  the  number  of  accumulation  periods  in  the  observa- 
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tion,  J  is  the  number  of  frequencies  at  which  the  cross 

spectrum  has  been  computed  and  (aap) j  is  the  a  priori  value  of 

(a) . -w  )  t  +  +  u  it  • .  The  multiplication  of  the  cross 

jog  t  DPD 

spectrum  by  exp(-i(a  )  .)  is  called  counter  rotation.  The 

ap  ] 

5q's  now  represent  corrections  to  the  a  priori  values  of  xg , 

4>t  and  Xp.  The  summation  over  j  is  restricted  to  cross 
spectral  components  at  one  frequency.  (In  Equation  (A. 11)  we 
are  using  one  summation  index  (cj)  to  represent  summations  over 
both  frequency  and  time.  The  single  summation  index  has  been 
chosen  to  simplify  the  equations.) 

We  have  now  developed  all  the  formulas  necessary  to  show 
equivalence  of  the  ML  and  least  squares  estimators.  The  ML 
estimator  is  represented  by  Equation  (A. 11) .  We  may  rewrite 
this  equation  as 

r.  i  i  (ib  -3  )  , 

E  Re{  re  '*q  q' } 

q 

q=l 

where  rg  and  <bg  are  the  magnitude  and  argument  of  Zg.  From 
each  accumulation  period  we  will  obtain  one  estimate  of  this 
magnitude  and  argument  at  each  frequency  in  the  cross  spectrum 
which  has  been  evaluated.  Taking  the  real  component  of  the 


above  expression  we  obtain 


E  r  cos (ib  -  3  ) 
q=l  q  q  q 


(A. 12) 


for  the  expression  to  be  maximized.  When  Equation  (A. 12)  is 
near  its  maximum  value  Bg  will  approximately  equal  <l>g  and  we 
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may  expand  (A. 12)  in  series  which  yields 


Uj  2 

Z  r  (1  -  (4»  -  5  )  /2  +  higher  order  terms) 
q*l  q  q  q 

Clearly  the  above  expression  may  be  maximized  by  minimizing 
QJ  2 

E  rn 

q-1  q  q  q 

But  this  minimization  is  simply  the  weighted  least  squares 
condition  with  the  rq's  being  interpreted  as  weights. 

We  would  expect  the  rq's  t0  be  inversely  proportional  to 
the  variances  of  the  phases  of  the  cross  spectrum  4^.  To 
investigate  this  expectation  we  return  to  the  form  of  Za. 

From  Equation  (A. 11)  we  have 


<r?K 


a  n  (3) 

l2ql  -  I.*  _ -2 

H  3*1 


-  «*  -i(“ap)i  , 

T— I  8  1 


(A. 13) 


We  will  investigate  this  equation  in  two  parts,  firstly  the 

variance  term  and  secondly  the  cross  spectral  term. 

2  2  2 

The  variance  term  is  OjK/C  (cn^)  j  (c^)  j)  where 
.2  _  «’.>i<»„l>1«,n2)* 


'  +  <Vj(0n2>j  +  K  <V  j  “m*  J 

and  (°g)jr  (°ni)j  and  (°n2) j  are  the  variances  of  the  real  or 

imaginary  components  of  the  spectra  of  the  signal  at  the  first 

site  and  the  noise  at  each  site.  The  constant  K  accounts  for 

the  difference  in  the  amplitude  of  the  signal  at  the  output  of 

the  receiver  between  the  first  and  the  second  site.  These 

2  2 

variances  are  given  by  iaa) ^  *  Ta^/2,  (°ni)j  *  Ts^/2  and 


(<*n2)j  =  Ts2/2.  K  is  equal  to  /Ta2/Ta^. 

For  nearly  all  VLB1  observations,  of  nearly  all  extra- 
galactic  radio  sources,  the  antenna  temperatures  are  much  less 
than  the  system  temperatures.  Most  radio  telescope,  source 
combinations  have  Ta's  <  1  K,  whereas  currently  available 
wideband  receivers  have  system  temperatures  between  100  K  and 
200  K.  Even  with  modern  cooled  wideband  receivers  it  is 
doubtful  that  system  temperatures  will  drop  much  below  20  K. 
Assuming  that  Ta<<Ts  the  expression  for  a j  reduces  to 

°j  -  <Vj  ■  TV2 


Incorporating  the  other  variance  terms  we  obtain 


2„ 


2/Ta1Ta2 

Ts1Ts2 


(A. 14) 


The  above  term  is  considered  independent  of  both  j  and  q  in 
the  algorithms  used  by  FRNGE. 

To  evaluate  the  remaining  part  of  rg  it  is  convenient  to 
make  one  further  assumption.  We  assume  that  variance  terms 
given  above  are  constant  over  the  duration  of  an  accumulation 
period,  i.e.  for  a  period  typically  less  than  2  seconds.  We 
may  then  write 


i  K  i  -  i 


2k 
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(A. 15) 


We  will  define  the  coherently  averaged  cross  spectrum,  5a,  as 


_  n(q)  * 

Ca  =  r  X,  .X 
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(A. 16) 
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We  should  note  that  C®  is  the  conjugate  of  the  summation  in 
Equation  (A. 15);  see  discussion  after  Equation  (A. 10) . 

From  Equation  (A. 5)  we  have 


*!j*2j  e  '“aP'5'+  SN’e 


— *  -i(aap)j 


*  i  (a.- (a  )  +)  *-i(a  ). 

KS  Nj^e  3  ap  3  +  N^e  ap  3 


(A. 17) 


The  magnitudes  of  these  terms  are,  respectively,  |K55  |  ■ 
/Ta-^Taj ,  |SN2|  *  /rajTs2  J  KS  |  =  /^a^Ts^  and  |N^N2|  = 
/Ts^TSj  •.  If  ^  a  ^ap'  fc^en  as  we  sun*  the  n(q)  values  in  the 
accumulation  period,  the  signal  term  K§S*exp[ i ( a^- (a  _).)] 
will  have  constant  phase  and  the  net  signal  magnitude  will  be 
n (q) /Ta1Ta2.  The  noise  terms  on  the  other  hand  will  have 
random  phases  for  each  value  in  the  summation  and  the  contrib¬ 
ution  of  the  noise  components  will  increase  only  as  the  /n(q) . 
The  relationship  between  the  terms  in  the  coherently  averaged 
cross  spectra  is  represented  graphically  in  Figure  A. 2.  The 
variance  of  the  amplitude  of  the  noise  terms  will  be  Rayleigh 
distributed  (Whitney,  1974,  Section  3.2).  When  the  real  and 

imaginary  components  of  the  noise  terms  have  equal  variance 

2  2 
(o  ),  the  variance  of  their  amplitude  will  be  (2-n/2)0  . 

From  this  figure  we  can  also  estimate  the  uncertainty  in 
the  phase  of  the  cross  spectrum.  Since  Ta<<Ts,  the  dominant 
noise  term  will  be  due  to  the  product  of  the  system  noise 
terms  for  the  various  sites.  Further,  assuming  that  n(q)  is 
sufficiently  large  that  /Ta^Ta^  >>  /Ts^Ts^/n (q) ,  we  may 


Figure  A, 2  Representation  of  the  components  of  the  cross  spectrum 
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=  the  coherently  averaged 
cross  spectrum 


An(q)  / 2)  (Ta2Ts1+Ta1Ts2+Ts1Ts2)  (2-tt/2> 


project  the  noise  term  in  a  direction  normal  to  the  signal 
term;  it  will  have  a  variance  of  n (q) (TajTs^+Ta^TSj+Ts^TSj) /2 
The  variance  of  the  phase  will  be 


2  Ts.Ts 

a.  *  _ 1  2 

^  2  n(q)  Ta^Ta2 


(A. 18) 


(Note:  n(q)  can  nearly  always  be  made  sufficently  large  to 
satisfy  this  last  assumption  because  we  can  always  repeat  the 
correlation  process  with  an  updated  phase  delay  rate  if  the 
initial  phase  delay  rate  is  not  sufficiently  accurate  to  allow 
n(q)  to  be  selected  large  enough.) 

Also  from  Figure  (A. 2)  we  see  that 


n(q) 


*  e'^ap’j 


*  n  (q)  ✓Tap'a^ 


Combining  the  above  result  with  Equation  (A. 14)  we  have  that 


rq  “  2TalTa2  n (q)  (A. 19) 

Ts1Ts2 

which  by  comparing  with  Equation  (A. 18)  is  seen  to  be  inverse¬ 
ly  proportional  to  the  variance  of  the  phase  of  the  cross 
spectrum,  as  expected.  This  completes  the  proof  of  equival¬ 
ence  of  the  maximum  likelihood  estimator  and  weighted  least 
squares . 

The  above  result  was  not  unexpected  since  it  can  be 
proved  generally  for  linear  estimation  problems  with  Gaussian 
statistics  that  the  maximum  likelihood  and  least  squares 
estimators  are  equivalent.  It  should  be  noted  that  the  equiv¬ 
alence  is  valid  only  when  the  linearization  in  Equation  (A. 12) 
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is  valid.  (In  Section  3.1  we  saw  that  the  phase  differences 
between  the  model  and  the  observed  cross  spectral  phases  are 
sufficently  small  for  the  linearization  of  Equation  (A. 12)  to 
be  valid.) 

The  FRNGE  algorithms  and  least  squares  are  equivalent 
when  the  system  and  antenna  temperatures  are  independent  of 
frequency  and  time.  Now  that  we  know  where  the  FRNGE  algor¬ 
ithms  are  deficient  we  can  use  the  least  squares  approach  to 
correct  the  results  from  FRNGE  if  there  is  evidence  that  the 
system  and  antenna  temperatures  are  not  independent  of  fre¬ 
quency  and  time. 

The  least  squares  approach  also  provides  an  excellent 
method  of  visualizing  how  the  VLBI  observables  are  estimated. 
The  quantity  to  be  minimized  is  the  weighted  sum  of  the 
squares  of  (^-“g)  where  the  (J^'s  are  the  observed  phases  of 
the  cross  spectrum  and  5^  is  simply  a  mathematical  model  of 
the  dependence  of  these  phases  on  the  group  delay,  visibility 
phase  and  phase  delay  rate  (Equation  (A. 3)).  The  VLBI 
measurements  are  made  by  least  squares  fitting  of  straight 
lines  to  the  observed  phases  as  a  function  of  frequency  and 
time.  The  slope  of  the  phase  versus  frequency  at  a  given 
epoch  is  the  group  delay  referred  to  that  epcoh;  the  slope  of 
the  phase  versus  time  multiplied  by  frequency  is  the  phase 
delay  rate,  and  the  intercept  of  these  lines  at  a  given  epoch 
and  frequency  is  the  visibility  phase.  The  visibility  phase 
divided  by  the  frequency  to  which  it  is  referred  (wQ)  is  the 
(ambiguous)  phase  delay. 


Finally,  before  leaving  this  appendix  we  ask  why  FRNGE 
uses  a  maximization  approach  rather  than  the  least  squares 
estimator.  Figure  (A. 3)  shows  the  sum  of  the  squares  of  the 

mm 

residuals  (4>  -S „)  when  l  is  varied  and  and  -t  are  esti- 

q  q  g  t  p 

mated  to  produce  the  mininum  sum  of  the  squares  of  the  residu¬ 
al  phases,  i.e.  we  carried  out  a  series  of  least  square  fits 
to  "hypothetical"  cross  spectral  phases  but  with  the  group 
delay  constrained  to  a  series  of  values  between  -20  and  20 
nsec.  (See  Appendix  E  for  a  discussion  on  "constrained"  least 
squares.)  To  produce  this  figure  we  assumed  that  the  suc¬ 
cession  of  four  frequencies  in  the  synthesized  band  were 
separated  by  100,  250,  and  300  MHz.  (A  review  of  the 
equations  for  the  model  used  to  fit  the  group  delay  to  the 
cross  spectral  phases.  Equation  (A. 3) ,  will  show  that  the  sum 
of  the  squares  of  the  phase  residuals  does  not  depend  on  the 
absolute  frequency  of  each  channel  when  $t  is  estimated  as 
is  varied.)  It  is  clear  that  there  are  a  number  of  local 
minima,  to  any  one  of  which  the  least  squares  may  converge 
(remember,  we  are  dealing  with  a  nonlinear  problem  and  hence 
we  would  need  to  iterate  the  least  squares  solution) .  So  even 
with  the  least  squares  approach  it  would  still  be  necessary  to 
search  over  many  trial  values  to  ensure  that  the  global  mini¬ 
mum  was  found.  However,  once  the  region  of  the  correct 
minimum  is  located  the  least  squares  estimator  is  an  excellent 
way  to  estimate  the  values  of  the  VLBI  observables. 

We  should  note  in  Figure  A. 3  that  after  the  group  delay 
has  been  changed  by  20  nsec,  the  sum  of  the  squares  of  the 


value  of  group  delay  (nsec) 
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residual  cross  spectral  phases  is  zero.  This  perfect  fit  of 
the  model  (Equation  (A. 3))  to  the  observations  ocurrs  because, 
for  the  the  sequence  of  frequencies  used  to  produce  Figure 

A.  3,  a  change  of  20  nsec  in  the  group  delay  will  change  the 
model  phases  at  each  frequency  by  exact  multiples  of  2 tc 
radians.  The  spacing  between  the  group  delay  values  which 
fulfill  this  relationship  is  called  the  group  delay  ambiguity. 
Its  value  is  given  by  the  inverse  of  the  largest  common 
denominator  of  the  frequency  spacings.  It  may  appear  that  the 
group  delay  ambiguities  could  be  resolved  because  the  fits  of 
the  group  delay  to  the  cross  spectral  phases,  within  each  2 
MHz  bandwidth  channel,  will  be  affected  by  changes  in  the 
group  delay  by  multiples  of  20  nsec.  This  is  not  the  case, 
however,  because  in  the  implementation  of  the  maximization  of 
the  delay  resolution  function,  two  distinct  group  delays  are 
estimated:  the  multiband  group  delay  and  the  singleband  group 
delay.  The  multiband  group  delay  is  estimated  from  the  change 
in  cross  spectral  phase  between  the  (widely  spaced)  channels. 
The  singleband  group  delay  is  estimated  from  the  change  in 
phase  within  the  2  MHz  channels.  We  discuss  these  two  delay 
measures  in  detail  in  Section  2.2  and  Appendix  B.3  (Equation 

B. 3.3),  (see  also  Whitney,  1974,  pp  156-158). 


! 
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Appendix  B.  Evaluation  of  the  delay  resolution  function 


In  Appendix  A  we  developed  the  maximum  likelihood  esti¬ 
mator  for  VLB  I  observables.  We  assumed  that  the  cross 
spectrum  of  the  signals  received  at  each  site  was  available  in 
the  radio  frequency  (RF)  band,  i.e.  at  frequencies  around  8 
GHz  for  X-band  and  2  GHz  for  S-band.  However,  in  Chapter  2  we 
saw  that  the  signals  were  recorded  at  each  site  after  the  RF 
signals  had  been  mixed  with  a  local  oscillator  (LO)  signal 
such  that  the  spectra  of  the  recorded  signals  were  shifted  in 
frequency  by  the  LO  frequency.  In  this  appendix  we 
investigate  the  relationship  between  the  spectra  of  the 
recorded  ("video")  signals,  and  the  spectra  of  the  signals 
before  they  were  heterodyned  (Section  B.l) .  We  will  then 
investigate  the  cross  correlation  function  of  the  video 
signals  and  compare  this  cross  correlation  function  with  the 
cross  correlation  function  which  would  be  obtained  if  the 
signals  at  RF  were  recorded  (Section  B.2).  Once  we  have 
investigated  the  cross  correlation  functions,  we  will  derive 
the  algorithms  used  by  the  program  "FRNGE"  to  estimate  the 
cross  spectral  components  of  the  recorded  data  and  to  obtain 
the  maximum  likelihood  estimates  of  the  group  delay,  phase 
delay  rate  and  visibility  phase  (Section  B.3) .  Finally  we 
will  investigate  the  effects  of  the  approximations  made  in  the 
correlator  algorithms  on  the  group  delay,  phase  delay  and 
phase  delay  rate  estimates  (Section  B.4) . 

Firstly,  to  avoid  any  confusion  concerning  sign  conven- 
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tions  and  definitions  of  cross  correlation  functions  and  cross 
spectra ,  we  describe  these  quantities  in  Table  B.l.  An  impor¬ 
tant  point  to  note  is  that  the  delay  resolution  function 
requires  the  complex  conjugate  of  the  cross  spectrum  (see 
discussion  concerning  Equation  (A. 10)),  which  may  be  obtained 
by  correlating  with  the  sign  of  the  lag  (?)  opposite  to  that 
shown  in  Table  B.l  (we  will  discuss  this  topic  further  in 
Section  B.2) .  We  have  continued  to  use  the  complex  conjugate 
for  consistency  with  earlier  works  (Rogers,  1971;  Whitney, 
1974;  Whitney  et  al. ,  1976)  .  (Part  of  the  confusion  about 
conjugates  can  also  be  attributed  to  the  cross  spectrum  C^^tu) 

being  the  complex  conjugate  of  the  cross  spectrum  defined  in 

_ 

Table  B.l,  i.e.  C12 (w) sC2i (“) •  for  r®al  signals.) 


Relationship  between  the  video  and  RF  cross  spectra 


In  Chapter  2  we  saw  that  the  spectrum  of  the  upper  side¬ 
band  (USB)  video  signals  were  related  to  the  spectrum  of  the 
RF  signals  by  (see  Equation  (2.1.1)), 
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where  S^(wv)  is  the  spectrum  of  the  video  signals  or  simply 
video  spectrum,  S^(o)v±u^)  is  the  spectrum  of  the  RF  signal, 
is  the  sum  of  the  frequencies  of  the  first  and  second  local 
oscillators,  and  4^  is  the  sum  of  the  phases  of  the  local 
oscillators  for  the  upper  sideband.  For  the  lower  sideband 


Table  B.l  Definitions  of  the  sign  conventions  for  dela 
cross  correlation  function  and  cross  sprectrum. 


Delay  Tg  -  fc2  ~  fcl  ' 

where  t^  and  t2  are  the  travel  times 
from  the  source  to  each  station 
(In  this  table  will  be  assumed 
constant  and  equal  to  the  phase  delay  )  B 

The  relationship  between  signals  recorded  at 

each  site  is  (neglecting  noise) 

sx(t)  =  s  2  ( t+  t  ) 
or  y 

S2(t)  =  Sl(t_tg)  * 

The  spectra  of  the  signals  will  be 

CD 

S1(u.)  =  /  Sl(t)e"ia>tdt 

—  CO 

and 

®  .  . 

S2  (  u>)  =  /  s2  ( t)  e~  “  dt 


From  the  definition  of  i  ,  these  spectra  will  be  related  by 
§2  (to)  =  S^to)  e”1  WTg 


The  cross  correlation  function  R12(T)  *s 


R12(t)  =  /  ( t-x) s2 (t) dt  =  J  sx  (t) s2  (t+x)dt 


The  cross  spectrum,  C^2  (<*>)»  of  the  signals  from  each  site  is 
00  . 

C12(u>)  =  /  R12(t)  e'luT  d-t 
—  00 

From  the  definition  of  the  cross  spectrum  we  have 

C12(w)  =  S*(u>)S(u>)  =  5*  (wjSj^  (w)  e“i£0Tg  =  C21(w) 

The  delay  resolution  function  requires,  as  input,  the  complex 
conjugate  of  C12(u>),  as  defined  in  this  table  (see  text). 


(LSB)  video  signals,  the  relationship  between  the  video 


spectrum  and  the  RF  spectrum  is  (see  Equation  (2.1.2)), 


5*<V 

-  S1(wv-w1) 

e  U1  , 

for 

A 

> 

3 

0 

*  S1(mv+“1) 

i$* 

e  V1  , 

for 

V 

> 

3 

0, 

-1  Ji 

where  S^(wv)  is  the  LSB  video  spectrum  and  the  sum  of  the 

phases  of  the  local  oscillators  for  the  lower  sideband.  The 

same  local  oscillator  is  used  to  heterodyne  both  upper  and 

lower  sidebands  from  IF  to  video,  but  since  the  USB  and  LSB 

signals  propagate  through  different  circuits  in  the  single 

sideband  mixer  (see  Section  2.1)  ,  they  could  have  different 

phase  shifts  added  to  them.  Hence  we  use  the  two  different 

n  0 

phases  and  The  specifications  for  the  video  converters 

indicate  that  the  difference  in  phase  of  the  upper  and  lower 

o 

sidebands  should  be  less  than  5  (Mark  III  System  Documenta¬ 
tion,  1980,  p.  VC-8). 


At  the 

second  site 

of  the 

interferometer 

similar 

rela- 

tionships  exist  between 

the  video 

spectra  and 

the  RF 

spectrum: 

§2<V 

*  §2(VW2) 

e  2 

9 

for  o>v 

>  o. 

(B.l.l.a) 

3 

IV} 

*  §2(<W 

e-”S 

9 

for  w 

V 

<  o. 

(B.l.l.b) 

and 

S*(o>  ) 

*  §  (u>  -w  ) 

e’H2 

9 

for  « 

>  o. 

(B.l.l.c) 

2  v 

Bf(«  ) 

2  v 

2  v  2 

■  vw 

e  ^2 

9 

V 

for  u 

V 

<  0. 

(B.l.l.d) 

We  now  develop  the  relationship  between  the  video  spectra 
of  the  data  from  each  site.  In  Appendix  A  we  derived  the 


relationship  between  the  spectra  for  signals  at  RF.  From 
Equation  (A. 4)  we  have 

S  2  ( )  =  ^(w)  exp[-i  (u>-wQ)  xg  -  i$t  -  iio-fpAt] 

where  is  the  frequency  to  which  the  visibility  phase, 
is  referred,  and  At  is  the  time  difference  between  the  epoch 
of  the  observation  (i.e.,  the  time  to  which  and  are 
referred)  and  the  time  of  the  determination  of  the  cross 
spectral  components.  (Note  that  the  components  of  the  cross 
spectrum  at  a  specific  time  can  be  determined  from  a  4  jxsec 
segment  of  data.)  If  we  take  w  equal  to  w  +u>^  in  Equation 
( A. 4 ) ,  we  obtain 

S2(o>v+w1)  =  Sx  (o^+w^expf-i  (  (wv+w1-u>o)  tg+  +  (u^+u^)  tpAt]  )  . 
But 

n  i  *u 

S1((Dv+a)1)  =  s“(u>v)  e"lvl  ,  for  USB  and  u>v>  0 

and 

l 

Si<w  +wi)  =  S^(tov)  e  *^1  ,  for  LSB  and  o>v<  0. 

Hence 

S2(Wv+wi)  *  ®i  exp^- *  l  (u>v+w^-<i>0)  Tg+  ^t+  (a)v  +wi)^pAt 

+  )  }  ,  for  USB,  u>v>  0 

and 

-  -A 

S2(u,v+U>l)  *  S^^leitpt-iKY^-Wjly  ♦t+  («*>v  +u>1)'tpAt 

+  $*  )  }  ,  for  LSB,  o)y<  0. 

On  the  left  hand  side  of  the  equations  we  still  have  the 
RF  spectra  of  the  signals  from  the  second  site.  To  relate 
these  RF  spectra  to  the  video  spectra  we  again  need  to  trans¬ 
late  the  spectra  by  the  local  oscillator  frequencies.  Hence 
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the  spectra  of  the  signals  at  site  2  at  video  frequencies 
become 

u  -iau 

§2  (u>v+  *  Sj  (u>v+  Uj)  e  w2  ,  for  USB,  u»v>  0 

and 

—  —X  _i*^ 

S2  («v+  w^)  *  Sj  (wy+  w^-  Uj)  e  w2  ,  for  LSB,  u>v<  0. 

Therefore  the  relationship  between  the  video  spectra  of 
the  recorded  data  at  each  site  becomes 

§2(wv+wl"u,2)  *  Sl(wv)  exPf-i  I  (“v+“1-“0>  Tg+  ♦t 

+  (w^w^t  At  +  »  for  USB ,  wy>  0 

and 

Sj  (Wy+u^-Uj)  *  S*(wv)  exp{-i  [  (&)v+u>1-w0)  Tg+  *t 

+  (Uy+i^)  xpAt  +  (♦£-♦£)]}  ,  for  LSB ,  «v<  0 

In  the  equations  given  above  the  video  spectra  from  each 
site  are  referred  to  different  frequencies  («*>v  for  site  1  and 
ta)v+wl~“2  ^or  s*te  2).  However,  and  Wj  are  usually  quite 
closely  matched  since  they  are  derived  from  hydrogen  maser 
frequency  standards.  (Even  if  these  frequencies  are  not 
closely  matched,  it  is  unlikely  that  they  will  be  greater  than 
the  frequency  offset  due  to  the  phase  delay  rate;  see  comments 
below.)  For  the  data  we  will  be  analysing  the  maximum 
frequency  offset  between  the  hydrogen  masers  will  be  0.1  Hz  at 
X-band  (for  the  Effelsberg  hydrogen  maser  during  the  July, 

1980  experiment) .  More  typical  values  for  frequency  differ¬ 
ences  are  0.01  Hz  (again  at  X-band) .  These  frequency  differ¬ 
ences  are  very  small  compared  to  the  frequency  spacings  to  be 
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considered  in  the  cross  spectrum  (0.25  MHz)  and  the  frequency 
offset  due  to  the  rotation  of  the  Earth  (this  latter  offset  is 
implicitly  contained  in  the  phase  delay  rate  term,  and  may  be 
as  high  as  16.8  kHz  at  X-band) .  We  can  change  the  frequency 
offset  of  the  video  signals  at  the  second  site  by  introducing 
a  time  varying  phase  term  similar  to  the  phase  delay  rate 
term. 

To  see  how  this  term  may  be  introduced,  we  consider  the 
calculation  of  the  video  cross  spectrum.  The  video  spectrum 
at  the  second  site  is  computed  from  a  finite  duration  of  data 
(AT)  by 

^2  (u,v+a>1~u>2)  =  /  s2(t)  e~la>vt  e'1(wrw2)t  dt 
AT 

If  ( a)2~w2 )  AT  <<  1  (AT  must  be  selected  to  fullfil  this  condi¬ 
tion)  ,  then  the  (Wj-u^t  term  may  be  treated  as  a  constant 
during  the  integration  which  would  yield 

B2(wv+Ml“w2)  *  ®2  (u,v)  e"l(wl"w2)t. 

Substituting  this  equation  into  the  relationship  between  the 
video  spectra  we  obtain 


§2(V 


sU 


S i  ( u>y)  exp{-i  [  (u)v+«1-u)0)  +  ♦j.  +  (“>v+<*>1)  *pAt 

+  (♦“-♦J)  -  («f“2)t)),  for  USB,  wv>  0 

(B.  1. 2. a) 


and 


S2«V 


5* 


Sf(«v>  exp{  -  i  t  (u)v+W^-0)o)  Tg  +  ♦^  +  (wv+U^)  TpAt 


+  ■  Uj-Wj)  }»  for  LSBr  u,v< 


(B.l. 2.b) 


The  expressions  for  the  relationship  between  the  video 


spectra  for  USB  with  wy  <  0  and  for  LSB  with  >  0  may 
generated  from  Equation  (A. 4)  by  substituting  «  -  fa>v~ui 
Using  the  same  procedures  which  led  to  Equations  (B.1.2 
obtain 


3i(«v)  exp(-i  Tg  -  4t  + 

-  (+”-+£)  +  for  0SB< 


(B 

and 

§2(wv)  *  3£(wv)  exp{-i  xg  -  4t  +  (uv-u>1)t] 

-  (♦f-ij)  +  (Wj-ttjJtJ},  for  LSB 

(B 

If  we  group  all  the  local  oscillator  terms  together  by 
defining 

A4  (t)  2  ^  (<o^—  ***2)  ^ 

and 

A**(t)  2  ( 4>2~ <*>1  >  +  ' 

then  Equations  (B.1.2)  and  (B.1.3)  become 
Bjfwy)  *  3^  (o)y)  exp{-i  [o)v±w^+w0)  Tg  T 

±  A4U(t)  ]  }  , 

and 

s£(uv)  *  §^(wv)  exp{-i(Wv+Wl±u0)  Tg  ± 

±  A4>°  (t) )  )  , 


♦t  +  (wv±w1)t] 
for  USB f  (B. 

♦t  +  * 

for  LSB,  (B. 


be 

we 

wv<  0 

1.3.  a) 

At 

wv>  0 

1. 3. b) 


At 

1. 4.  a) 

At 

1.4. b) 


where  upper  signs  refer  to  uy  >  0  and  lower  signs  to  «v  <  0 


From  Equations  (B.1.4)  we  see  that  the  phase  changes  due  to 
the  phase  delay  rate  and  the  oscillator  frequency  difference 
terms  have  opposite  sign  for  the  upper  and  lower  sidebands 
(remember  >>  wv  and  is  always  positive) .  We  will  see 
later  in  this  appendix  that  this  sign  reversal  causes  some  of 
the  errors  introduced  by  these  time  varying  phase  terms,  to 
cancel  when  upper  and  lower  side  band  data  are  combined. 

The  video  cross  spectrum  will  be,  for  upper  sideband 

Si(u,v)52*  (wv)  =  l5l(wv)l  exP(  +  i[  <“\,±aVa,0>Tg  4  *t 

+  (a)v±a>1 )  *  At  ±  A$U  ( t)  )  }  (B.1.5.a) 

and  for  lower  side  band 


(Wv)  |  exp(  +  i[(Uv+u,1±0)o)xg  * 


+  (u)  +(jj.)xAt  +  A  4  ( t)  ]  }  •  (B.  1 . 5.b) 

V  X  P 


If  we  compare  these  expressions,  with  those  for  the  cross 
spectrum  of  the  signals  at  RF,  we  see  that  the  video  and  RF 
cross  spectra  are  related  by 

u 


§1  (<‘,v)§2*  ("v1  =  §i(uv±ul)§2(u,v±u,2)  J  »  for  USB 


(B.1.6.a) 


and 


sJ(wv)S2* (wy)  -  Sx (wv+u1)S*(wv?w2)  expf  *i A$* ( t) ] ,  for  LSB. 


(B.i.e.b) 


Hence,  whenever  the  delay  resolution  function  requires 
the  cross  spectrum  at  RF,  we  can  substitute  the  video  cross 
spectrum.  However,  when  the  video  cross  spectrum  is  used 
there  is  an  additional  time  varying  term  introduced  because  of 


the  difference  in  oscillator  frequencies.  We  should  note  that 
the  delay  resolution  function  only  requires  the  positive 
frequency  components  from  the  cross  spectrum  (because  only  the 
independent  cross  spectral  components  were  used  to  derive  the 
delay  resolution  function;  see  Appendix  A) ,  and  hence  the 
cross  spectrum  will  only  need  to  be  computed  at  positive  video 
frequencies  for  USB  and  negative  video  frequencies  for  LSB. 

Before  developing  the  correlator  algorithms  it  will  be 
instructive  to  compute  the  cross  correlation  function  of 
signals  (without  noise)  recorded  at  video  frequencies.  We  may 
find  the  cross  correlation  function  by  taking  the  Fourier 
transform  of  the  cross  spectrum  of  the  signals.  The  cross 
correlation  function  will  be 

2xB  * 

R12(t)  *2^bS1  (ta,v)S2(wv)  el“yT  d“v  ' 

where  S^(u>v)  and  SjCwy)  are  used  to  denote  either  the  upper 
or  the  lower  sideband  video  spectrum.  The  complex  conjugates 
of  Equations  (B.1.5.a  and  b)  allow  us  to  express  the  integral 
above  in  terms  of  the  power  spectra  | ( <*>v)  |  and  |5*(uv)|« 

For  upper  sideband  data  we  have 


-  A*u(t)]}  eiwvT  dwy 


where  we  have  split  the  integration  into  separate  integrals 


for  positive  and  negative  frequencies  because  the  expressions 
for  the  cross  spectra  are  different  in  these  regions.  In 
Chapter  2  we  saw  that  the  power  spectrum  of  the  signals  should 
be  constant  over  a  2  MHz  bandwidth.  If  we  assume  this  is  the 
case,  then  we  can  integrate  the  expression  for  After 

some  algebra  we  obtain 

R^2(t)  *  COS{  (o)1-u)0)  xg  +  u^tpAt  +  *fc+  A*U(t)  +  itB(Tg+ipAt  -x)  } 

sin{nB(xg+  tpAt  -  x)}/[xB(xg+  tpAt  -  x )  J  .  (B.1.7) 

In  examining  this  cross  correlation  function  we  note  that 
the  time  lag  x  is  only  multiplied  by  the  bandwidth  B  whereas 
xg  (Table  B.l)  is  multiplied  by  both  the  bandwidth  and  the  RF 
frequency.  (Note  that  has  an  implicit  dependence  on  fre¬ 
quency  because  it  represents  the  phase  of  the  cross  spectrum 
at  the  reference  frequency  u)Q,  i.e.  =  t*>oxp)  .  Consequently 
as  the  lag  x  between  the  tapes  is  changed  the  arguments  in 
Equation  (B.1.7)  will  change  slowly,  whereas  if  either  the 
group  or  phase  delay  changes  these  arguments  will  change 
rapidly. 

It  is  instructive  to  compare  Equation  (B.1.7)  with  two 
other  cross  correlation  functions.  If  the  signals  at  RF  had 
been  recorded  (at  a  sampling  rate  of  samples/sec),  the 

cross  correlation  function  would  be 

R^2(x)  =  cos{(u,1-wo)xg+  u^tpAt  +  $t  -u^x  +  itB  ( xg+  *pAt  -  x)  } 

sin{*B(x  +  t  At  -  x)}/(xB(x  +  *  At  -  x)] 

9  r  7  r 

While  this  expression  looks  very  similar  to  Equation  (B.1.7) 


there  is  a  major  difference.  The  cross  correlation  function 
of  signals  recorded  at  RF  is  a  function  of  and  hence 
changes  very  rapidly  as  the  lag  between  the  tapes  is  changed. 

At  the  other  extreme,  if  the  signals  from  the  radio 
source  in  the  frequency  range  0-2  MHz  were  recorded,  their 
cross  correlation  function,  R^fx),  would  be 

R^2(x)  *  cos{sB(xg+  fpAt  -  x)  } 

sin{itB(Xg+  *pAt  -  T)}/[*B(Tg+  ipAt  -  x)  ] 

In  this  case  there  are  no  rapid  variations  in  the  cross  cor¬ 
relation  function  for  variations  in  either  x  or  x  . 

9 

The  cross  correlation  functions  given  above  are  shown 
graphically  in  Figure  B.1.1.  This  figure  highlights  the 
advantages  of  the  heterodyned  observations.  We  gain  the 
sensitivity  of  the  RF  recording  to  xg  and  retain  the  advan¬ 
tages  of  slow  sampling  rate  and  slow  variation  of  cross  cor¬ 
relation  function  of  0-2  MHz  recordings.  [The  loss  of  cor¬ 
relation  in  the  heterodyned  case  is  due  to  the  difference  in 
the  oscillator  phases,  which  appears  in  the  cosine  term  in 
Equation  (B.1.7).  The  oscillator  phases  add  directly  to  the 
cross  spectral  phases  at  all  frequencies  in  the  video  band¬ 
width,  i.e.  unlike  a  delay,  their  phase  (in  the  video 
spectrum)  does  not  depend  on  a  delay-like  quantity.  Conse¬ 
quently  these  phases  will  change  the  shape  of  the  video 
signals  at  each  site  (if  the  oscillator  phases  are  different) 
hence  decorrelating  the  signals.  This  loss  does  not  occur  if 
the  delay  resolution  function  is  used  (see  Rogers,  1970,  and 


Figure  B.1,1  Cross  correlation  functions 
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Section  B.4) . ] 

An  examination  of  the  cross  correlation  function  shown  in 
Figure  B.1.1  indicates  that  the  magnitude  of  the  cross  correl¬ 
ation  function  decays  rapidly  as  Tg+TpAt-t  deviates  from  zero. 
This  rapid  decay  allows  the  cross  spectrum  to  be  computed  from 
the  cross  correlation  function  estimated  at  a  small  number  of 
lags  about  the  a  priori  estimate  of  the  delay.  The  Mark  III 
correlator  evaluates  the  cross  correlation  function  at  eight 
lags  centered  on  the  bit  offset  closest  to  the  a  priori  estim¬ 
ate  of  the  delay.  The  lags  are  spaced  at  the  sampling  inter¬ 
val  (usually  0.25  ^sec) .  (Later  in  this  appendix  we  will 
discuss  the  effects  of  using  only  eight  lags.) 

B . 2  Correlator  algorithms 

We  will  now  derive  the  algorithms  used  by  the  Mark  III 
correlator  to  estimate  the  correlation  function  from  the  data 
recorded  on  the  tapes  during  an  observation.  Several  approx¬ 
imations  will  be  made  while  deriving  these  algorithms  and  we 
will  investigate  the  effects  of  these  approximations  on  the 
VLB I  measurements  in  Section  B.4. 

The  algorithms  will  be  derived  for  USB  data;  the  LSB 
algorithms  may  be  derived  using  the  same  procedures.  While 
deriving  the  correlator  algorithms  we  will  introduce  many 
symbols.  In  Table  B.2.1,  we  define  each  of  the  symbols  we 
will  be  using  frequently.  (This  table  is  meant  as  a  conveni¬ 
ent  reference,  more  exact  meanings  (if  necessary)  of  the 


Table  B.2.1  Summary  of  symbols 


X1 (t j  +  °f f set)  •  x2 (t j+of f set)  the  USB  signals  recorded  at  sites 
1  and  2. 

tj  the  epoch  of  the  j.th ,  4  nsec  data  segment. 

Q 

t  the  epoch  of  the  center  of  the  3th  accumulation 

period. 

tg  the  epoch  of  the  first  bit  in  the  <|th  accumulation 

period. 

t  the  epoch  of  the  corrections  to  the  a  priori  group 

delay,  phase  delay  rate  and  visibility  phase  (usual¬ 
ly  near  the  center  of  the  observation) . 

Atq  the  difference  in  the  epoch  of  the  3th  accumulation 

period,  t£,  and  t  . 

At  the  sampling  interval  (At  =  1/(2B)  =  0.25  nsec). 


N(q) 


the  number  of  bits  correlated  in  the  3th  accumula¬ 
tion  period. 

the  duration  of  a  short  segment  of  data  (4  nsec) 
the  duration  of  an  accumulation  period  (typically  2 
sec)  . 

the  duration  of  the  observation  (100-400  sec) . 


the  lag  between  the  tapes  during  correlation  (con¬ 
tinuous  variable). 

the  discrete  lag  index  (integral  multiples  of  At  )  ; 


Table  B.2.1  Continued 


ap 


ap 

/  b  , 

(’ap'j 


b 

'ap 


<*ap>q 


the  actual  lag  between  the  tapes  will  be  Axj+xj^ 

(see  below) . 

the  a  priori  value  of  x^  (see  Table  B.l);  in  most 

cases  x  will  be  a  function  of  time, 
ap 

the  a  priori  delay  at  the  center  of  the  observation. 

the  nearest  bit  shift  to  xgp  for  the  segment  of 

data  (at  the  center  of  the  segment) . 

(x^  )  .  when  reference  to  a  particular  segment  is  not 
ap  j 

needed. 

the  a  priori  value  of  the  phase  delay  rate  at  the 
center  of  the  cjth  accumulation  period. 


R^(AXi) 

R^(ATi) 

C  .  (w  ) 
j  v  v7 


Ca  (w  ) 
qr  v7 


the  USB  complex  cross  correlation  function  at  lag 
Ax  i  * 

the  LSB  complex  cross  correlation  function  at  lag 

Av 

the  video  cross  spectrum  of  the  j^th  segment  of  data 
(wv  takes  on  discrete  values  between  0.25  MHz  and 
1.75MHz).  Strictly,  this  is  the  conjugate  of  the 
cross  spectrum  as  defined  in  Table  B.l;  see  discus¬ 
sion  concerning  Table  B.l. 

the  video  cross  spectrum  coherently  averaged  over 
the  qth  accumulation  period,  £  represents  the  radio 
frequency  (RF)  of  the  channel  from  which  the  data 
were  taken  (see  comment  above) . 
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Table  B  .  2 . 1  Continued 
u  the  radio  frequency  of  u>v=0  (video  DC)  in  channel  rt 

u>o  the  radio  frequency  to  which  the  visibility  phase  is 

referred^ 

(o '  =wr +  i:B  the  radio  frequency  corresponding  to  the  center 
frequency  of  channel  r* 

B  the  recorded  bandwidth  (Hz) . 

and  6tp  the  trial  values  of  the  corrections  to  the  a 

priori  values  of  the  multiband  group  delay  and  phase 
delay  rate,  respectively. 

6rb  the  correction  to  the  a  priori  value  of  the  single 

band  delay. 

the  single  band  delay  index  (16  values  of  spaced 

at  half  the  sampling  interval,  it  ) . 

6x£  the  value  of  6x£  which  maximizes  the  multiband  delay 

function. 

Dqr(6x£)  the  single  band  delay  function  for  the  gth  accumula¬ 
tion  period  for  channel  z_;  see  Section  B.2. 

AC(w)  general  symbol  to  denote  the  integrated  cross 

spectrum  (exact  meaning  will  depend  on  context), 
general  symbol  to  denote  the  error  in  the  rate  of 
counter  rotation  of  the  cross  spectral  components. 

*  all  frequencies  denoted  by  u)  are  angular  frequency  measures 
(rad/sec)  . 


- '  -  Vw*  •  V  .*  W\ 
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symbols  will  be  given  in  the  text.)  Vfe  start  deriving  the 
algorithms  by  carrying  out  the  correlation  of  the  data  in 
short  segments  (each  of  4  nsec  duration,  i.e.  16  data 
samples) .  Later  we  will  see  that,  in  practice,  it  is  not 
necessary  to  correlate  the  data  in  this  fashion.  By  forming  a 
complex  correlation  function,  we  can  correlate  long  segments 
of  data  (of  up  to  2  sec  duration) .  These  long  segments  are 
called  accumulation  periods.  The  4-nsec-segment  correlation 
technique  will,  however,  give  us  a  convenient  means  of 
deriving,  and  studying  the  limitations  of,  the  complex 
correlation  function  procedure. 

We  denote  the  USB  data  on  each  tape  by  x^ (t j+of f set)  and 
x2 (t j+of f set) .  The  time  arguments  for  the  data  are  specified 
in  two  parts,  tj  is  the  epoch  (at  site  1)  of  the  center  of  the 
4  |isec  segment  to  which  a  particular  data  sample  is  assigned, 
and  the  offset  gives  the  epoch  of  the  data  sample  relative  to 
this  central  epoch.  These  time  arguments  are  given  in  dis¬ 
crete  form  because  we  have  sampled  the  signals  at  discrete 
times.  Using  the  techniques  we  discussed  in  Appendix  A,  we 
compute  the  cross  correlation  function  at  eight  lags  from  16 
samples  each  of  x1  and  x2.  The  value  of  the  cross  correlation 
function  for  each  of  the  lags,  At^,  evaluated  using  the  jth 
segment  of  data,  will  be  denoted  by  Rj(Axj).  This  cross 
function  coefficient  is  given  by 


R 


j 


(At.) 


8 

-k;_7Xl(V  kAts)x2(tj+ 


<'ap»j  +  kiV 


Ax . ) 


f 


(B. 2.1) 


where  the  At^,  the  lag  indices,  are  integral  multiples  of  the 


sampling  interval  Atg,  and  (x“p)j  is  the  epoch  of  the  bit 


nearest  to  the  a  priori  estimate  of  the  delay  between  the 


tapes  at  the  center  (k=0)  of  the  2th  segment  of  data.  When 


(t__) •  is  converted  to  a  offset  of  the  tapes  to  compensate  for 
ap  j 


this  delay,  the  offset  is  called  the  "bit  shift."  The  cross 
spectrum  can  now  be  computed  from  this  cross  correlation 
function.  The  cross  spectral  components  (wv)  from  the 


segment  will  be 


C .  (w  )  =  I  R^At.)  e_i“vATi  , 
J  i  =  -4  J 


(B.2.2) 


where  u>v  are  7  video  frequencies  spaced  at  intervals  of  0.25 
MHz  over  the  2  MHz  bandwidth  commencing  from  0.25  MHz.  We  now 
wish  to  sum  together  these  cross  spectral  components  from  each 
data  segment  accounting  for  the  changes  in  phase  due  to  the 
phase  delay  rate.  But  we  must  remember  the  cross  spectral 
components  which  we  have  computed  at  video  frequency  are  the 
same  spectral  components  which  were  originally  at  wv+<*>r,  where 
u>r  is  the  radio  frequency  (i.e.,  in  earlier  discussions)  of 
the  rth  channel  in  which  the  data  were  taken,  of  the  signal 
when  the  signal  arrived  at  the  radio  telescopes.  (We  derived 


the  expressions  which  relate  the  video  cross  spectrum  to  the 
RF  cross  spectrum  earlier  in  this  appendix  (see  Equations 
(B.1.6)).  In  those  derivations  we  needed  to  distinguish 
between  the  sum  of  the  LO  frequencies  at  the  two  radio  tele¬ 
scopes  (hence  the  notation  u>^  and  o^).  We  now  need  to  distin¬ 
guish  between  each  of  the  channels  and  hence  the  change  in 


notation.)  The  phases  of  these  cross  spectral  components  will 
be  changing  at  a  rate  given  by  the  product  of  their  original 
frequency  (u>v+u>r)  and  the  phase  delay  rate.  Hence  the  accumu¬ 
lated  cross  spectrum  will  be  given  by 


Ca  (w  ) 
qr '  v' 


n(q) 

Aci(V  8 


-i  (<D  )  (t  )  (t .-t  ) 

'  v  r#  1  ap'q'  3  q' 


(B.2.3) 


where  n(q)  is  the  number  of  data  segments  in  the  3th  accumu¬ 
lation  period,  t®  is  the  epoch  of  the  center  of  the  3th  accu¬ 
mulation  period  and  is  the  a  priori  phase  delay  rate 

ap  q  — - 

computed  at  the  center  of  the  acumulation  period.  We  should 
note  that  this  expression  is  only  valid  for  <*>v>0.  For  u>v<0 
the  sign  of  u>r  should  be  changed  (see  Equations  (B.1.4)).  The 
cross  spectral  components  for  «v<0  are  not  correctly  rotated. 
This  error  is  not  a  problem  because  the  delay  resolution 
function  only  requires  the  cross  spectral  components  with  “v>0 
for  USB  data  (see  discussion  associated  with  Equation 


(B. 1.6) ) . 


When  we  combine  all  of  the  previous  formulas  together  we 
obtain 


<“v> 


n  (q) 
Z 

j-1 


3  8 

iL4  kL7«i‘VMt.,*2<tj+(tip>i+Mv»8i> 

expt-ilw^Tj  +  (wv+wr)  (*ap)q(tj-tg) )}  . 

(B. 2.4) 


If  we  rearrange  the  terms  in  the  above  equation  and  substitute 
u'*ur+xB  ( oj *  is  the  radio  frequency  corresponding  to  the 


center  of  the  video  band) ,  we  obtain 


<u>v) 


3  n(q)  8  . 

L  Z  Z  {x,(t.  +  kAt s)x2(t.+  (x°  )  ,+kAt  -At.) 
i  =  -4  j»l  k=-7  1:5  s  ^  3  aP  3  si 

exp[-iw*  {^ap)q(tj-tq)  ]  }  expt-iu^Ax.] 
exp[-i(a,v-7iB)  (^ap)q(tj-tq)  1  .  (B.2.5) 


The  last  term  in  the  above  expression  represents  the  differ¬ 
ential  rotation  of  the  cross  spectral  components  at  ojy  rela¬ 
tive  to  the  component  at  the  center  of  the  video  band,  due  to 
the  phase  delay  rate.  If  we  neglect  this  term  we  see  that  the 
video  frequency  only  appears  in  one  other  place  (the  exp 
(-iuvAxi)  term).  The  exp  (-it^A-rp  term  is  independent  of  j. 
and  k  and  hence  we  can  perform  the  summation  over  j  and  k 
without  specifying  which  wv  will  be  used  (we  should  note  that 
u>r  does  not  depend  on  the  choice  of  uy  either)  .  (If  the  last 
term  is  included  in  the  summation,  then  u>v  must  be  specified 
in  order  to  carry  out  the  summation  over  j) .  Also,  if  we 
re-examine  the  summations  over  and  k»  we  see  there  is  no 
reason  to  keep  them  as  separate  summations  when  the  last  term 
in  Equation  (B.2.5)  is  neglected.  So  we  will  combine  these 
two  summations  together.  To  combine  the  summations  we  need  to 
change  the  way  we  specify  the  epoch  of  a  bit  as  a  function  of 
the  summation  indices.  We  will  introduce  a  new  summation 
index  which  is  related  to  j  and  k  by  p  *  16j  +  k  (remember 
that  the  j.  index  steps  through  the  segments,  each  of  which  is 
16  bits  long) .  The  epoch  of  a  bit  at  Site  1  will  now  be  given 
by  t^+pAt  ,  where  t*  is  the  epoch  of  the  first  bit  (at  Site 


1) ,  in  the  accumulation  period  (i.e,  tq+pAtg  »  tj+kAtg  when  p 
=  16j+k) .  With  the  introduction  of  the  summation  index  £, 
Equation  (B.2.5)  becomes  (when  we  neglect  the  exp(-i (o>v-itB) 


<*ap>q  term)' 

_  3  N(q)  l  h 

Cqr(V  *  .f_4  X1<V  pAts)x2(tq+  <TapV  pAV  AV 


ex 


pt-i"’  (tap)q(tp"tq)  1  *  *■  (B.2.6) 


where  N(q)  *  16n(q)  and  equals  the  number  of  bits  in  the  £th 
accumulation  period.  The  inner  summation  is  called  the  com¬ 
plex  cross  correlation  function,  R°(Atj),  which  is  evaluated 
at  8  complex  lags  (the  complex  lag  is  the  combination  of  the 
lag  Ax.  and  the  rotation  expl-iw’ (t  )  (t  -tc)],  i.e. 

i  ap  q  P  q 

Ru(iV  *  ^>xl<tq+  p“s,,2(tq*  <TapV  pAV  AV 


exp[-iw'(tap)q(tp-tq)  ]  . 


(B. 2.7) 


The  cross  spectrum  is  obtained  by  computing  the  Fourier 
transform  of  the  complex  cross  correlation  function.  The 
Fourier  transform  is  approximated  by  the  summation  over  i_  in 
Equation  (B.2.6).  The  use  of  4  nsec  segments  of  data  is  now 
no  longer  necessary;  all  the  data  in  an  accumulation  period  is 
summed  directly  into  the  complex  cross  correlation  function. 


B.3  "FRNGE"  algorithms 

Now  that  we  have  derived  the  correlator  algorithms,  we 


are  ready  to  develop  the  algorithms  used  by  FRNGE.  The 


correlator  only  computes  the  complex  cross  correlation  func¬ 
tion.  The  components  of  the  cross  spectrum  are  computed  in 
FRNGE.  Once  these  components  have  been  computed,  they  can  be 
used  to  evaluate  the  delay  resolution  function  for  trial 
values  group  delay  and  phase  delay  rate. 

The  delay  resolution  function  was  given  in  Appendix  A 
(Equation  (A.10)).  When  the  cross  spectral  components  from 
all  accumulation  periods  are  used,  the  delay  resolution 
function  becomes 


Q  R  7  a  -  c 

*  .S  CqrKj)  exPl-i<Vwvj-wo)6Vi(a,r+“vj)6tp(Vtc)1 
q=l  r=l  ]=-7  M  J  J  M 
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where  and  6td  are,  respectively,  the  corrections  to  the 


group  delay  and  the  phase  delay  rate  at  the  epoch  tc  of  the 


observation  (usually  tc  is  near  the  center  of  the 


observation) .  The  summation  over  3  represents  a  summation 
over  time,  Q  being  the  number  of  accumulation  periods;  the  £ 
summation  index  runs  over  the  number  of  frequency  channels 
(typically  <8  for  geodetic  observations)  ;  and  the  j  index  runs 
over  the  discrete  video  frequencies  for  upper  and  lower 
sidebands  (in  earlier  equations  we  did  not  need  to  specify 
discrete  video  frequencies  and  hence  we  did  not  use  a 
subscript)  .  (The  summation  over  2  excludes  the  video  DC  term  ; 
=  0.)  (In  Equation  (A. 10)  these  three  summations  were 
expressed  as  a  single  summation  (see  comments  made  at  that 
time) .)  If  only  one  sideband  is  available  the  summation  need 
only  be  evaluated  for  the  video  frequencies  which  are 
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available. 


If  we  had  an  infinitely  fast  computer ,  or  were  planning 
to  make  only  a  few  Mark  III  observations,  we  could  directly 
sum  Equation  (B.3.1)  using  a  series  of  trial  values  for  6xg 
and  6Xp  until  we  found  the  maximum  value  of  Equation  (B.3.1). 
(In  a  typical  observation  «10®  values  for  6xg  and  6xp  are 
tried.)  Unfortunately,  computers  work  at  finite  rates  and  the 
number  of  Mark  III  observations  is  not  small  (between  July, 
1980,  and  January,  1982,  over  60,000  Mark  III  observations 
were  made  in  geodetic  experiments) .  Consequently  the  actual 


calculation  of  Equation  (B.3.1)  is  not  by  direct  summation.  A 
large  saving  in  computation  time  can  be  made  by  adopting  two 
group  delay  estimates  rather  than  the  single  one  (6xg)  shown 
in  Equation  (B.*>.1).  These  two  delays  are  called  the  multi¬ 
band  group  del;  and  the  singleband  group  delay.  The  multi¬ 
band  group  delay  is  determined  from  the  change  of  phase 
between  the  widely  spaced  frequencies,  the  singleband  group 
delay  is  determined  from  the  change  of  phase  across  the  2  MHz 
recorded  bandwidth.  (There  are  other  distinctions  between 
these  delay  types  which  will  be  discussed  later.) 

We  may  incorporate  the  singleband  delay  in  Equation 
(B.3.1)  by  writing  this  equation  as 


Q  R  7  _ 

*  E ,  E  ,Cqr(wvj)  expl-i(u  -u  )6t 
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where  6xb  is  called  the  single  band  delay.  We  don't  seem  to 

have  saved  any  computer  time  by  introducing  the  singleband 

delay.  Actually,  the  maximization  of  Equation  (B.3.2)  would 

take  longer  than  the  maximization  of  Equation  (B.3.1),  because 

in  the  former  case  we  need  to  search  over  three  quantities 

(6xg,  6tp  and  6xfe) .  The  improvement  in  the  computation  time 

comes  about  by  using  an  efficient  method  to  search  over  the 

values  of  6?  ,  6$  and  61.  . 

g  p  b 

The  values  of  6Xg,  6xp  and  6xb  which  maximize  Equation 
(B.3.2)  will  be  the  maximum  likelihood  estimates  (within  the 
limits  of  the  approximations  made  during  the  derivation;  see 
Section  B.4) ,  of  the  multiband  group  delay,  singleband  group 
delay  and  phase  delay  rate.  There  are  a  number  of  techniques 
we  could  use  to  perform  the  maximization,  e.g.  "brute  force" 
searching,  using  successively  finer  spacings  of  the  trial 
values  until  a  maximum  is  found,  or  we  could  perform  a 
"coarse"  search  and  use  the  least  squares  estimator  with  the 
coarse  search  estimates  as  the  a  priori  values  (see  Appendix 
A).  For  the  remainder  of  this  appendix,  we  will  discuss  one 
particular  implementation  of  the  maximization  of  Equation 
(B.3.2),  that  carried  out  in  the  Haystack  Observatory's 
program  "FRNGE."  This  program  has  been  used  to  process  all  of 
the  data  which  is  analysed  in  Chapters  3  through  8.  The 
program  "FRNGE",  being  software,  is  subject  to  change 
(especially  if  its  algorithms  are  found  to  deficient).  The 
algorithms  which  we  will  discuss  are  those  used  at  this  time 
(May,  1983)  but  they  may  change  in  the  future  (see,  e.g.,  the 


discussions  in  Section  B.4  on  the  neglect  of  video  rotation)  , 
making  obsolete  some  of  the  discussion  which  follows. 

The  efficent  search  methods  are  developed  by  carrying  out 
the  summations  in  Equation  (B.3.2)  in  stages.  Firstly,  the 
summation  over  j  is  performed  to  generate  the  "singleband" 
delay  function,  Dqr(6-rb),  (the  remaining  summations  are 
performed  later) , 

7 

Dqr(6tb’  ■  ?_7Cqr<“vj>  6tb>  (B.3.3) 

Equation  (B.3.3)  is  evaluated  at  16  trial  values  of  the  cor¬ 
rection,  6t£,  to  the  a  priori  delay,  spaced  at  half  the  sam¬ 
pling  interval  and  centered  on  zero.  Before  proceeding,  let 
us  consider  the  meaning  of  Equation  (B.3.3).  If  we  were  to 
maximize  the  magnitude  of  Dqr(6ffc)  by  searching  over  many 
values  of  what  would  be  the  results  (assuming  that  6x^  is 

no  longer  restricted  to  the  16  discrete  values) ?  For  dual 
sideband  data,  the  value  of  which  maximizes  the  magnitude 
of  Dqr(6Tb^'  would  be  a  least  squares  estimate  of  the  slope  of 
the  cross  spectral  phases  across  the  combined  4  MHz  channel 
pair.  For  single  sideband  data,  the  slope  is  across  the  2  MHz 
channel;  see  Appendix  A.  The  phase  of  Dqr^Tjj),  where  6Tb  is 
the  value  of  6xb  which  maximizes  the  magnitude  of  Dgr (6xb) » 
would  be  the  estimate  of  the  intercept  of  the  sloped  line  at 
“vj  =  0,  the  DC  edge  of  the  video  band,  which  corresponds  to 
the  radio  frequency  «r  (see  Section  B.l) .  We  will  refer  to 
this  intercept  as  the  "video  DC  phase".  Consequently, 
maximizing  the  singleband  delay  function  can  be  viewed  as 


estimating,  from  the  14  cross  spectral  phases  for  dual 
sideband  data,  a  slope  across  the  recorded  bandwidth  (the 
singleband  delay)  and  a  phase  at  video  DC  (the  video  DC 
phase) .  If  single  sideband  data  are  used,  only  seven  cross 
spectral  phases  are  used  to  estimate  the  slope  and  the  inter¬ 
cept.  These  concepts  of  the  nature  of  the  singleband  delay 
function  will  be  important  later  on. 

The  uniform  spacing  of  the  values  allows  Equation 
(B.3.3)  to  be  computed  using  a  32  point  fast  Fourier  transform 
( FFT)  routine. 

After  the  computation  of  the  singleband  delay  function, 
the  remaining  summations  in  Equation  (B.3.2)  are  carried  out 
for  many  trial  values  of  the  phase  delay  rate  and  multiband 
group  delay  .  These  summations  are  again  carried  out  using 
FFT  routines.  To  facilitate  using  the  FFT  routines  the  phase 
delay  rate  summation  is  computed  for  trial  values  of  the 
"fringe  rate"  rather  than  for  trail  values  of  the  phase  delay 
rate.  (The  "fringe  rate"  is  defined  to  be  <*>r^p»  and  repre¬ 
sents  the  rate  of  change  of  phase  with  time,  in  contrast  to 
the  rate  of  change  of  delay  with  time,  which  is  the  phase 
delay  rate.  The  fringe  rate  is  used  in  doing  the  FFT's 
because  standard  routines  (contained  in  firmware)  which  use 
the  standard  transformation  between  the  "time"  and  "frequency" 
domains  can  be  directly  used.)  The  FFT's  constructed  using 
the  fringe  rates  and  the  multiband  delays  are  called  the 
"fringe  rate  spectrum"  and  the  "multiband  delay  function." 

The  maximum  value  of  the  multiband  delay  function  will 
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occur  at  the  trial  values  of  the  multiband  group  delay,  the 
singleband  group  delay  and  the  fringe  rate  which  are  closest 
to  the  values  of  these  quantities  which  maximize  the  delay 
resolution  function.  The  FFT's  used  to  compute  the  multiband 
delay  function  can  be  computed  very  rapidly  which  allows  the 
evaluation  of  the  multiband  delay  function  at  a  large  number 
of  trial  values  of  Xg,^  and  ip*  (Usually  between  128  and  256 
values  are  used  for  x„  and  x  .  16  values  of  x.  are  used  as 
mentioned  earlier  (x^  will  not  be  determined  very  accurately 
and  hence  does  not  require  a  fine  spacing  (see  earlier  discus¬ 
sions  on  the  interpretation  of  the  singleband  delay)).)  To 
determine  the  final  values  of  and  f  an  iterative  pro¬ 

cedure  is  used  in  which  the  delay  resolution  function  is 
summed  directly  using  finer  spaced  values  of  xg  and  xp  which 
are  centered  on  those  values,  found  from  the  FFT's  in  the 
first  iteration,  or  from  the  previous  iteration  in  subsequent 
iterations.  (These  summations  will  only  be  carried  out  a  rela¬ 
tively  small  number  of  times  and  hence  the  efficiency  of  the 
FFT's  is  not  needed.) 

The  delay  resolution  function,  R  (6x^,5-?^, 6?^)  ,  as 
implemented  in  FRNGE,  is  constructed  from  the  singleband  delay 
functions  (rather  than  from  the  cross  spectral  components)  by 
direct  summation  of  these  functions: 


R(6*  ,6t  ,6*.  ) 
g  p  b 
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where  is  the  phase  calibration  phase  for  the  jrth  channel 

(these  phases  are  discussed  in  detail  in  Section  2.3);  is 

the  value  of  6x£  which  maximized  the  multiband  delay  function 
(remember  that  were  16  discrete  values  used  to  construct 
the  single  band  delay  function) .  The  final  term, 

(-inBtiqr  ( 6*-6t£)  )  ,  in  Equation  (B.3.4)  we  will  examine  in  some 
detail.  The  variable  riqr  is  given  by 

V  =  INX(«3) -Nu(q)  3  /[NA(q)  +NU ( q)  ]  (B.3.5) 

where  Nu(q)  and  N^tq)  are  the  number  of  bits  correlated  in  the 
upper  and  the  lower  sidebands,  respectively,  in  the  3th 
accumulation  period.  (Prior  to  this  time  we  had  not  needed  to 
distinguish,  in  the  summations,  between  upper  and  lower  side¬ 
bands  and  hence  we  have  been  using  one  symbol  to  denote  the 
total  number  of  bits  correlated  in  both  sidebands,  N (q) ,  (N(q) 
=  Nu(q)+NA(q);  see  Table  B.2.1).)  If  we  examine  Equation 
(B.3.5)  we  see  that  if  only  USB  data  are  available  Tlqr!S_l» 
i.e.  NJl(q)=0;  if  only  LSB  data  is  available, 

Nu(q)  *  0.  We  first  consider  these  two  limiting  cases.  In 
Figure  B.3.1  we  show  graphically  the  purpose  of  this  last 
term.  In  the  upper  plot  we  show  the  cross  spectral  phases 
from  USB  data  in  two  frequency  channels  (we  have  chosen  just 
two  to  simplify  the  figure).  Because  the  singleband  delay 
index  is  discrete,  the  value  of  which  maximizes 

D(6?g, 6x^,6^)  will  not  necessarily  correspond  to  one  of  the 
discrete  indices.  The  singleband  delay  is  obtained  from  the 
least  squares  fit  of  a  group  delay  and  a  phase  at  u>r  (video 


DC)  to  the  cross  spectral  phases  in  the  2  MHz  bandwidth 
channels  (see  discussion  on  Equation  B.3.3).  Because  the 
singleband  delay  function  (Equation  (B.3.3))  is  evaluated  at 
discrete,  equally  spaced,  values  of  (rather  than  at  the 

value  of  which  will  maximize  the  delay  resolution)  ,  the 
phase  at  u>r  will  be  in  error  by  -itB (St^-St^)  when  only  USB 
data  used  (also  see  discussion  below) .  The  final  term  in 
Equation  (B.3.4)  corrects  for  the  phase  error  at  video  DC. 

Figure  B.3.1.b  shows  the  effects  of  the  discrete  single¬ 
band  delay  on  LSB  data.  From  this  figure  we  clearly  see  the 
same  error  as  in  the  USB  case  except  that  the  sign  of  the 
error  has  changed.  Figures  B.3.1.a  and  b  should  indicate  why 
the  term  involving  T)gr  is  included  in  Equation  (B.3.4),  but 
how  do  we  correct  the  immediate  cases  when  some  data  are 
available  in  both  sidebands?  we  need  to  calculate  the  video 
DC  phase  error  (see  discussion  on  Equation  (B.3.3)  which  will 
result,  when  we  perform  a  least  squares  fit  of  a  slope  and 
intercept  to  unequally  weighted  data,  when  the  slope  is  total¬ 
ly  constrained  to  a  specific  value  (see  Appendix  E  for  a 
discussion  on  constrained  least  squares  estimation) .  The 

value  of  will  then  be  calculated  to  correct  for  this 
'qr 

error.  (The  values  of  the  slope  are  constrained  because  the 
single  band  delay  function  has  only  be  computed  at  discrete 
values  of  6x£.)  This  estimation  problem  is  simplified  by  the 
weights,  given  to  each  cross  spectral  phase,  being  equal 
within  a  sidband.  (This  is  not  quite  true  because  the 
response  of  the  bandpass  filters,  in  the  video  converters,  is 


not  constant  across  the  2  MHz  bandwidth  of  each  sideband.  See 
Section  B.4  for  more  details.)  Since  the  weights,  within  a 
sideband  are  approximately  equal,  we  may  replace  the  seven 
cross  spectral  phases,  from  each  sideband,  with  their  mean 
value  at  their  central  frequency  (1  MHz  for  USB  and  -1  MHz  for 
LSB) .  Consequently,  the  phase  error  at  video  DC,  when  the  two 
sidebands  are  combined,  will  be  the  weighted  mean  of  these 
central  frequency  phases.  The  difference,  A$w,  between  the 
weighted  mean  and  the  algebraic  mean,  will  be  given  by 

A4w  =  f  lwA/<wu+wx)  J  (-*B)  +  Iwu/(wu+wA)  )  (uB)  ]  (6Tb-6^) 

(B.3.6) 

where  wy  and  w^  are  the  weights  given  to  the  phases  from  the 
upper  and  the  lower  sidebands.  In  Appendix  A,  we  saw  that  the 
weights  to  be  used  during  the  least  squares  solution  are 
proportional  to  the  product  of  the  number  of  bits  correlated 
and  the  product  of  the  antenna  temperatures  divided  by  the 
system  temperatures  a.t  the  two  sites  in  the  interferometer 
(see  Equation  (A.19)).  The  latter  term  in  the  weight  (i.e., 
the  product  of  the  antenna  temperatures  divided  by  the  system 
temperatures) ,  will  probably  remain  constant  during  an  obser¬ 
vation  (typically  100  sec)  and  hence  the  major  variations  in 
the  weights  assigned  to  each  sideband  will  be  due  to  changes 
in  the  number  of  bits  correlated.  Hence  if  we  replace  wu  and 
w^  with  Ny(q)  and  N^(q) ,  in  Equation  (B.3.6),  we  obtain  the 
expression  containing  r)qr  in  Equation  (B.3.4).  The  possible 
errors  which  are  introduced  into  the  estimates  of  the  multi- 


DC)  to  the  cross  spectral  phases  in  the  2  MHz  bandwidth 
channels  (see  discussion  on  Equation  B.3.3).  Because  the 
singleband  delay  function  (Equation  (B.3.3))  is  evaluated  at 
discrete,  equally  spaced,  values  of  (rather  than  at  the 

value  of  6?^  which  will  maximize  the  delay  resolution),  the 
phase  at  u>r  will  be  in  error  by  -xB  ( 6t^-6t^)  when  only  USB 
data  used  (also  see  discussion  below) .  The  final  term  in 
Equation  (B.3.4)  corrects  for  the  phase  error  at  video  DC. 

Figure  B.3.1.b  shows  the  effects  of  the  discrete  single¬ 
band  delay  on  LSB  data.  From  this  figure  we  clearly  see  the 
same  error  as  in  the  USB  case  except  that  the  sign  of  the 
error  has  changed.  Figures  B.3.1.a  and  b  should  indicate  why 
the  term  involving  is  included  in  Equation  (B.3.4),  but 

how  do  we  correct  the  immediate  cases  when  some  data  are 
available  in  both  sidebands?  We  need  to  calculate  the  video 
DC  phase  error  (see  discussion  on  Equation  (B.3.3)  which  will 
result,  when  we  perform  a  least  squares  fit  of  a  slope  and 
intercept  to  unequally  weighted  data,  when  the  slope  is  total¬ 
ly  constrained  to  a  specific  value  (see  Appendix  E  for  a 
discussion  on  constrained  least  squares  estimation) .  The 
value  of  r)qr  will  then  be  calculated  to  correct  for  this 
error.  (The  values  of  the  slope  are  constrained  because  the 
single  band  delay  function  has  only  be  computed  at  discrete 
values  of  6t£»)  This  estimation  problem  is  simplified  by  the 
weights,  given  to  each  cross  spectral  phase,  being  equal 
within  a  sidband.  (This  is  not  quite  true  because  the 
response  of  the  bandpass  filters,  in  the  video  converters,  is 
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not  constant  across  the  2  MHz  bandwidth  of  each  sideband.  See 
Section  B.4  for  more  details.)  Since  the  weights,  within  a 
sideband  are  approximately  equal,  we  may  replace  the  seven 
cross  spectral  phases,  from  each  sideband,  with  their  mean 
value  at  their  central  frequency  (1  MHz  for  USB  and  -1  MHz  for 
LSB) .  Consequently,  the  phase  error  at  video  DC,  when  the  two 
sidebands  are  combined,  will  be  the  weighted  mean  of  these 
central  frequency  phases.  The  difference,  A$w,  between  the 
weighted  mean  and  the  algebraic  mean,  will  be  given  by 

A0W  =  [  [wA/(wu+wA)  ]  (-*B)  +  [wu/(wu+Wjl)  ]  (*B)  ]  (6Tb-6^) 

(B.3.6) 

where  wy  and  w^  are  the  weights  given  to  the  phases  from  the 
upper  and  the  lower  sidebands.  In  Appendix  A,  we  saw  that  the 
weights  to  be  used  during  the  least  squares  solution  are 
proportional  to  the  product  of  the  number  of  bits  correlated 
and  the  product  of  the  antenna  temperatures  divided  by  the 
system  temperatures  at  the  two  sites  in  the  interferometer 
(see  Equation  (A.19)).  The  latter  term  in  the  weight  (i.e., 
the  product  of  the  antenna  temperatures  divided  by  the  system 
temperatures) ,  will  probably  remain  constant  during  an  obser¬ 
vation  (typically  100  sec)  and  hence  the  major  variations  in 
the  weights  assigned  to  each  sideband  will  be  due  to  changes 
in  the  number  of  bits  correlated.  Hence  if  we  replace  wu  and 
w^  with  Ny(q)  and  ( q) ,  in  Equation  (B.3.6),  we  obtain  the 
expression  containing  tigr  in  Equation  (B.3.4).  The  possible 
errors  which  are  introduced  into  the  estimates  of  the  multi- 


band  group  delay  by  the  use  of  the  discrete  singleband  delay 
are  discussed  further  in  Section  3.1. 


The  need  for  this  correction  to  the  phase  at  i*>r  could  be 
eliminated  if  the  singleband  delay  functions  were  recalculated 
from  the  cross-spectral  components  at  6*b.  This  approach 
would  however  lead  to  considerably  more  computations.  If 
these  additional  computations  were  carried  out  (and  conse¬ 


quently  tiqr;  set  to  zero  in  all  cases)  ,  Equation  (B.3.4)  would 


be  the  same  as  Equation  (B.3.2),  with  the  addition  of  the 
phase  calibration  phases.  (There  is  one  term  neglected  in 
going  from  Equation  (B.3.2)  to  Equation  (B.3.4):  the  term 


-iu>vj6fp(tq-tc)  .  We  will  discuss  the  effect  of  this  approxi¬ 


mation  when  we  are  investigating  a  similar  term  which  was 
neglected  in  deriving  Equation  (B.2.6),  namely 


-i«vjWtq,-) 


A  parabola  is  fitted  through  the  magnitude  of  the  delay 
resolution  function  evaluated  at  three  points  centered  on  and 
equally  spaced  about  the  estimates  for  the  multiband  group 
delay,  singleband  group  delay  and  phase  delay  rate  from  either 
the  FFT's  or  the  previous  iteration.  Such  a  parabola  is 
determined  separately  for  each  observable,  i.e.  the  parabola 
is  fitted  in  only  one  dimension.  The  phase  delay  rate  is 
fitted  first,  followed  by  the  multiband  group  delay  and  then 
the  singleband  delay  is  estimated.  The  fitting  process  is 
repeated  twice  more  using  the  new  estimates  of  the  multiband 
delay,  phase  delay  rate  and  the  singleband  delay.  The  final 
iteration  is  carried  out  with  half  the  spacing  of  the  trial 
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values  for  the  multiband  delay  and  phase  delay  rate.  The 
visibility  phase  is  then  obtained  from  the  phase  of  the  delay 
resolution  function  evaluated  at  the  final  estimates  of  the 
multiband  delay,  single  band  delay  and  phase  delay  rate. 

Is  this  search  technique  assured  of  converging  to  the 

actual  maximum  value  of  the  delay  resolution?  If  it  were 

iterated  an  "infinite”  number  of  times,  using  successively 

finer  spacings  for  the  one  dimensional  parabolic  fits  -  yes. 

To  see  why,  for  the  delay  resolution  function,  convergence  is 

assured  (it  isn't  assured  for  all  functions),  we  need  to 

consider  several  properties  of  the  delay  resolution  function 

and  the  initial  search  algorithms.  We  stated  earlier  that  the 

initial  search  over  many  trial  values  is  carried  out  using 

FFT's.  We  may  use  several  properties  of  Fourier  transforms  to 

deduce  the  properties  of  the  delay  resolution  function.  The 

delay  resolution  function  is  "bandlimited"  (see,  e.g., 

Bracewell,  1978,  for  a  discussion  on  properties  of  discrete 

Fourier  transforms) ,  in  both  the  delay  and  rate  domains 

because  only  a  finite  bandwidth  is  spanned  (usually  <400  MHz) , 

and  observations  are  of  a  finite  duration  (usually  between  100 

and  400  sec) .  Because  the  spanned  bandwidth  and  observation 

duration  are  finite  there  are  finite  maximum  frequencies  which 

can  exist  in  the  delay  resolution  function.  (We  are  using 

frequency  here  as  a  general  term  to  indicate  variations  of  the 

magnitude  of  the  delay  resolution  function  with  changes  of 

and  6-i  ,  rather  than  in  its  usual  sense  of  variations  with 
P 

time.)  The  highest  frequency  variations  which  can  occur  with 


changes  in  and  6-ip  are  the  inverse  of  spanned  bandwidth 

and  the  inverse  of  the  duration  of  the  observations, 

respectively.  The  initial  trial  values  6fg  and  6ip  are 

separated  at  most  by  a  quarter  and  a  half  of  the  inverses  of 

these  highest  frequency  variations,  respectively.  (The 

numbers  of  trial  values  of  and  Mp  are  selected  to  be  the 

smallest  powers  of  two  (to  insure  efficient  operation  of  the 

FFT's  (Bracewell,  1978,  Chapter  18)),  which  will  sample  the 

6tg,  6t  space  with  at  least  these  frequencies.)  For  a  100 

sec  observation  with  a  spanned  frequency  range  of  400  MHz,  the 

initial  spacings  of  6?  and  will  be  at  most  0.6  nsec  and 

g  p 

0.6  psec/sec  (at  X-band) .  Because  the  fringe  rate  spectrum 

and  multiband  delay  function  are  oversampled,  we  are  assured 

that  the  initial  search  values,  which  yield  the  maximum  value 

of  the  multiband  delay  function,  are  within  half  of  the  search 

value  separation,  of  the  peak  of  the  delay  resolution 

function,  i.e.  we  know  the  peak  cannot  be  located  nearer  to 

any  other  values  because  this  would  require  higher  frequencies 

in  the  delay  resolution  function  than  can  exist. 

Once  these  initial  values  are  found,  the  maximization  is 

carried  out  with  one-dimensional  fits,  and  re-computation  of 

Equation  (B.3.4)  at  the  fitted  values,  repeated  three  times. 

The  peak  of  the  delay  resolution  function  will  be  reached  when 

dR/8Tg=0  and  aR/8Tp»0  (these  conditions  are  satisfied  by  the 

above  technique) ,  provided  a2 R/a2t  +82R/82t  >  a2R/(8t  at  ) , 

y  r  y  f 

(Hildebrand,  1976,  pp.  356-357).  This  latter  condition  is 
satisfied  by  the  delay  resolution  function  and  may  be 


confirmed  by  direct  differentiation  of  the  function. 
Consequently,  the  converged  values  of  the  one  dimensional  fits 
will  be  at  the  maximum  of  the  delay  resolution  function. 

When  we  introduced  the  singleband  delay,  the  reason 
stated  was  for  computational  efficiency.  But,  there  is  a 
second  reason  for  the  introduction  of  this  delay.  Consider 
the  effects,  on  the  cross  spectral  phases,  of  adding  a  piece 
of  ( ron-dispersi ve)  cable  (or  any  delay  introducing  device) , 
between  the  first  LO  and  the  IP  distributor  (see  Figure 
2.1.2).  The  arrival  time  of  the  signal  at  the  sampler  (see 
Section  2.1)  would  be  later  than  in  the  case  where  the  addi¬ 
tional  cable  was  not  present.  All  of  the  cross  spectral 
phases  would  change  by  an  amount  given  by  the  product  of  their 
intermediate  frequency  (because  the  cable  was  inserted  between 
the  first  LO  and  the  IF  distributor)  and  the  delay  through  the 
cable.  In  addition,  the  phase  calibration  phases  would  also 
change  by  the  product  of  their  IF  frequency  by  the  delay. 

When  we  introduced  the  singleband  delay  function  (Equation 
(B.3.3)),  we  discussed  that  maximization  of  this  function 
essentially  allows  us  to  replace  the  fourteen  or  seven 
(depending  on  whether  dual  or  single  sideband  data  are  used)  , 
cross  spectral  phases  in  each  recorded  channel  with  a  slope 
(the  singleband  delay)  and  an  offset  (the  phase  at  video  DC) . 
Clearly,  this  slope  will  be  affected  by  the  additional  cable 
and  hence  the  singleband  delay  estimate  will  change.  However, 
the  change  in  phase  at  video  DC  will  be  the  same  as  the  change 
in  the  phase  calibration  phase  at  the  correponding  RF  and 


hence  when  the  phase  calibration  is  applied  (see 
Equation(B. 3. 4) ) ,  the  calibrated  phases  (at  video  DC)  will  be 
the  same  as  these  phases  before  the  cable  was  inserted.  (This 
"equality"  is  not  exact  because  the  phase  calibration  signals 
will  have  a  frequency  10  kHz  higher  than  that  of  the  video  DC. 
Provided  the  delay  introduced  by  the  cable  is  less  than  277 
nsec,  the  difference  between  the  video  DC  phase  and  the  phase 
calibration  phase  will  be  less  than  1  degree.)  Since  the 
multiband  delay  is  estimated  from  the  changes  in  the  video  DC 
phases  between  channels,  the  multiband  group  delay  will  not  be 
affected  by  this  additional  cable. 

We  can  now  extend  these  arguments  to  show  that  all  delays 
in  the  receiver  and  cables,  after  the  injection  of  the  phase 
calibration  pulses,  are  calibrated  by  the  phase  calibration 
system  (if  this  system  is  functioning  correctly;  see  Section 
3.2)  and  the  multiband  delay  will  be  unaffected  by  these 
delays.  The  multiband  delay  measures  the  difference  in 
arrival  times  of  the  signals  from  the  radio  source  at  the 
injection  point  of  the  phase  calibration  pulses.  On  the  other 
hand,  the  singleband  delay  measures  all  of  the  delays  through 
the  receiving  system.  The  singleband  delay  measures  the 
difference  in  arrival  times  of  the  signals  at  the  samplers. 
Consequently,  the  singleband  and  multiband  delays  measure  the 
difference  in  arrival  times  at  different  locations.  If  the 
delay  through  the  receiver  remains  constant  then  there  should 
be  just  a  constant  offset  between  the  singleband  and  multiband 
delays . 
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The  location  in  the  receiver  to  which  the  singleband  and 
multiband  delays  are  referred  could  be  made  the  same, 
assumming  that  the  contribution  of  the  receiver  electronics  to 
the  cross- spectral  phases  varied  linearly  with  frequency  over 
the  2  MHz  recorded  bandwith,  by  extracting  the  phase  of  the 
calibration  rail  at  the  mid  video  frequency  (i.e.  1.010  MHz). 
This  technique  has  not  been  adopted  in  the  design  of  the  Mark 
III  'see  Section  2.3  for  discussion). 

We  should  also  note  that  the  group  delay  will  be  deter¬ 
mined  predominantly  from  the  changes  in  the  phase  between 
channels  (which  will  span  100-400  MHz) .  The  additional 
information  about  the  group  delay  which  would  be  supplied  by 
the  change  in  phase  across  only  2  MHz  will  only  have  a  small 
effect  on  the  knowledge  of  the  group  delay  from  the  channel  to 
channel  variation  in  phase. 


B.4  Analysis  of  the  effects  of  the  approximations  made  in 


resolution  function 


rotations 


3)  The  neglect  of  the  rotation  across  the  video  band 
while  accumulating  the  complex  cross  correlation 
function  and  while  summing  the  delay  resoution 
function. 

4)  The  accumulation  of  the  cross  correlation  function  for 
the  duration  of  the  accumulation  period  (up  to  2 
seconds)  using  an  a  priori  phase  delay  rate  which  may 
not  be  correct. 

These  approximations  will  be  analyzed  in  the  order 
listed.  We  will  be  concerned  with  their  effects  on  the  multi¬ 
band  group  delay,  the  phase  delay  and  the  phase  delay  rate. 

In  Figure  B.1.1  we  saw  that  the  cross  correlation  func¬ 
tion  decayed  rapidly  as  the  offset  betweeen  the  tapes  was 
moved  away  from  the  actual  delay  at  the  epoch  of  recording. 

But  is  eight  lags  (0.25  nsec  apart)  enough  to  calculate  ade¬ 
quately  the  cross  spectrum  from  the  cross  correlation  func¬ 
tion?  For  the  initial  analysis  of  this  problem  we  will  assume 
that  the  power  spectral  density  of  the  video  signals  are 
independent  of  frequency  over  the  2  MHz  bandwidth  of  each 
channel.  For  this  case  we  may  calcuate  the  cross  correlation 
in  analytic  form  and  compute  the  cross  spectrum  from  the  cross 
correlation  function  at  the  eight  lags  computed  by  the 
correlator.  We  may  then  directly  compare  these  estimates  of 
the  cross  spectrum  components  with  the  actual  cross  spectrum 
to  evaluate  the  errors  introduced  by  using  only  eight  lags. 

This  problem  was  analyzed  by  Whitney  (1974)  ,  but  he 


considered  the  errors  introduced  by  using  seven  lags  (the 
number  of  lags  used  in  the  Mark  I  correlator) .  For  his 
analysis  he  also  assumed  that  the  actual  group  and  phase 
delays  were  zero.  This  assumption  is  not  realistic  because 
even  if  the  delay  were  zero  at  one  epoch  in  an  observation  it 
would  be  nonzero  at  other  times  because  of  the  (nonzero)  phase 
delay  rate.  For  a  phase  delay  rate  of  2  nsec/sec,  the  delay 
would  change  by  200  nsec  during  a  100  second  observation. 

The  cross  correlation  function  of  the  USB  video  signals 
is  given  by  Equation  (B.1.7).  For  the  analysis  of  the  effects 
of  using  only  eight  lags  to  compute  the  cross  spectrum  we  will 
simplify  the  form  of  Equation  (B.1.7)  by  assuming  that  xg  =  Tp 
and  A <J> u  ( t)  =  0.  We  will  also  replace  by  wr  (the  RF  of  the 
£th  channel  from  which  the  data  were  obtained) ,  which  is 
consistent  with  the  notation  of  Equation  (B.3.2),  (see  also 
Table  B.2.1).  (We  will  investigate  the  effects  of  relaxing 
these  assumptions  after  we  have  carried  out  an  initial 
analysis  of  the  problem.)  In  its  simplified  form.  Equation 
(B.1.7)  becomes 


r“2(x)  =  cos(u>rxg(t)  +  wB(t  (t)-T)) 


sin(nB(x  (t)-x) ) 


*B(t  (t) -t) 


(B.4.1) 


where  xg  ( t)  represents  the  delay  at  time  t^,  i.e.  we  have 
incorporated  the  phase  delay  rate  term  into  the  delay  xg(t). 
To  compute  the  cross  spectrum,  the  correlation  function 


was  evaluated  at  eight  lags  about  the  bit  shift  closest  to  the 
a  priori  estimate  of  the  delay.  The  cross  spectrum,  computed 


for  discrete  values  of  video  frequency  between  0.25  MHz  and 
1.75  MHz,  and  spaced  at  0.25  MHz  intervals,  was  determined  by 
Fourier  transforming  the  correlation  function  evaluated  at  the 
eight  lags.  The  phases  and  amplitudes  of  the  cross  spectral 
components  were  then  used  to  estimate  the  phase  at  video  DC 
(this  phase  should  equal  the  phase  at  RF)  and  the  group  delay 
across  the  2  MHz  bandwidth,  i.e  the  singleband  delay  (see 
discussions  in  this  appendix  concerning  the  maximization  of 
Equations  (B.3.1)  and  (B.3.2)). 

We  will  use  this  simple  model  to  study  some  of  the 
properties  of  the  errors  introduced  by  using  eight  lags.  We 
will  then  study  this  problem  using  a  correlation  function 
which  more  closely  matches  the  actual  properties  of  the 
signals  recorded  on  tape.  The  results  obtained  using  this 
second  approach  will  be  compared  with  the  results  from  actual 
correlation  tests.  All  of  these  tests  will  be  carried  out 
initially  for  USB  data  only.  We  may  then  easily  deduce  the 
effects  of  this  approximation  (using  only  eight  lags)  on  the 
LSB  data  and  dual  sideband  data. 

Equation  (B.4.1)  has  two  important  properties.  Firstly, 
the  cosine  expression  contains  a  very  rapidly  changing  term: 
u)rTg(t).  For  <1^=8  Ghz  and  ip'lxlO  this  rapidly  changing 
term  will  rotate  through  a  full  cycle  in  125  usee,  corres¬ 
ponding  to  correlation  of  500  bits  from  each  tape  (during  a  2 
sec  accumulation  period,  8x10^  bits  will  be  correlated). 
Secondly,  even  if  the  a  priori  values  of  the  group  delay  and 
phase  delay  rate  were  known  exactly,  the  difference  x (t)-T 


could  not  be  kept  constant  because  x  can  only  be  changed  at 

integral  multiples  of  the  sampling  rate. 

In  Figure  B.4.1,  we  show  the  bounds  of  the  phase  errors 

at  video  DC  (see  discussion  of  the  singleband  delay  function 

(Equation  (B.3.3))  for  reasons  for  computing  the  phase  error 

at  video  DC),  for  values  of  x_(t)-xa  .  where  x,^  is  the  a 

g  ap  ap  — 

priori  estimate  of  Xg(t),  ranging  between  -0.8  and  +0.8  usee. 
These  phase  errors  at  video  DC  are  a  direct  measure  of  the 
error  in  the  visibility  phase.  In  the  insert  to  the  figure  we 
show  for  a  very  small  section  the  actual  behavior  of  these 
phase  errors.  Each  cycle  of  the  phase  error  will  be  sampled 
=500  times  for  8.0  GHz  data  and  =2000  times  for  2.0  GHz  data. 
The  samples  over  each  cycle  will  therefore  average  to  the  mean 
value  of  the  bounds.  The  mean  value  curve  is  shown  as  the 
dashed  curve  in  Figure  B.4.1. 

During  correlation, x(t)-x__  will  not  remain  fixed  at  one 

y  ap 

location  on  the  mean  value  curve  because  x  (t)  is  changing 

y 

continuously  while  Xgp,  the  nearest  bit  to  the  a  priori  delay, 
can  only  be  changed  discretely,  i.e.  during  correlation  is 

a  p 

used  not  x,^  (see  Table  B.2.1  and  Section  B.2)  .  The  bit 
ap 

offset  between  the  tapes  is  kept  within  ±0.125  usee  of  the  a 
priori  delay  as  the  data  are  correlated.  The  final  phase 
error  will  thus  be  an  average  of  the  phase  errors  over  the 
inteval  ±  .125  usee  about  the  error  in  the  a  priori  delay. 

The  average  values  of  the  phase  errors  are  shown  in  Figure 
B.4.2. 


It  should  be  noted  that  the  phase  errors  are  anti- 


(deg) 


symmetric  about  0.125  nsec,  not  about  zero.  The  asymmetry  of 
the  error  about  zero  will  lead  to  phase  delay  "closures 
errors"  around  triplets  of  baselines  if  the  a  priori  delays, 
when  referred  to  a  common  epoch,  sum  to  zero  around  the 
triplet.  (We  discuss  such  delay  closure  errors  in  more  detail 
in  Chapter  3.)  The  asymmetry  can  be  removed  by  using  an  odd 
number  of  correlator  lags. 

The  phase  errors  shown  in  Figure  B.4.2  should  be  easily 
detected  by  correlating  a  pair  of  tapes  a  number  of  times 
using  different  values  for  the  a  priori  delay  each  time.  This 
series  of  correlations  was  carried  out  by  A.E.E.  Rogers  at  the 
Haystack  Observatory.  Before  we  present  the  results  of  this 
test  we  should  re-evaluate  our  model  for  the  phase  errors, 
trying  to  match  our  model  to  the  real  Mark  III  system  as 
closely  as  possible. 

The  first  improvement  we  can  make  to  the  model  is  to 
replace  the  perfect  (i.e.,  "flat")  2  MHz  bandpass  filter  with 
a  more  realistic  filter.  The  Mark  III  video  converters  use  a 
seven  pole  Butterworth  filter  (see,  e.g..  Mason  and  Zimmerman, 
1960)  to  filter  the  output  of  video  mixers  before  the  signals 
are  clipped  and  sampled.  This  filter  will  determine  the 
properties  of  the  power  spectrum  of  the  recorded  signals.  The 
transfer  function  for  the  Butterworth  filter  is  given  by 
(Mason  and  Zimmerman,  1960,  p.426) 

P  _1 

hu)  «  n  (iv  -  s.) 
j*l  ■* 

where  S.  ■  exp[-ix (P+2 j-l) / (2 P) )  and  represents  the  poles  of 
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the  filter,  P  is  the  number  of  poles  (seven  in  our  case) , 
v=w/u>k  and  is  the  fullwidth  at  the  50%  (-3db)  power  points 
for  the  filter  output.  For  the  Mark  III  video  converters  tofa  = 
1.8  MHz.  The  seven  pole  Butterworth  filter  power  response  is 
shown  in  Figure  B.4.3.  When  the  Butterworth  filter  is  used 
the  cross  spectrum,  Cb(wv),  (the  superscript  b  denotes  that 
this  is  the  cross  spectrum  which  would  be  obtained  when 
Butterworth  filters  are  used),  of  the  recorded  data  will  be 

Cb(wv)  *  h1(wv)H*(mv)  exp[  i(o)y  ±  wrTg(t)]  (B.4.2) 

,  * 

where  H1(u>v)  and  H2(<*>v)  are  the  transfer  functions  of  the 

Butterworth  filters  at  the  two  sites.  The  above  equation  is 

simply  an  extension  of  Equation  (B.1.5.a)  with  I (w  ) | #  (the 

response  of  the  "flat"  band  pass  filter) ,  replaced  by  the 

response  of  the  Butterworth  filters  at  the  two  sites.  Because 

we  are  currently  assuming  that  t  ,  we  have  absorbed  the 

g  p 

phase  delay  rate  into  a  time  dependent  delay  (the  group  and 
phase  delay  are  assumed  equal  (for  this  current  discussion) , 
hence  the  group  delay  rate  will  equal  the  phase  delay  rate) . 
The  upper  sign  is  for  u>v>0  and  the  lower  sign  is  for  u>v<0. 

We  can  simplify  the  computations  for  this  problem  by 
remembering  that  when  the  correlator  accumulates  the  complex 
cross  correlation  function,  the  rotations  which  are  applied  to 
compensate  for  the  phase  delay  rate  have  the  correct  sign  for 
<*>v>0  only,  for  USB.  (For  LSB  data,  the  correct  sign  for  wv<0 
is  used.)  Consequently,  the  spectral  components  from  USB  data 
with  wv<0  will  not  increase  in  magnitude  proportional  to  the 


Relative  power 


Figure  B.4.3  Seven  pole  Butterworth  filter  power  response 
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number  of  bits  correlated  (N(q);  see  Table  B.2.1),  as  the  data 
are  accumulated.  The  USB  spectral  components  with  u>v<0  will 
be  rotated  in  the  wrong  direction.  We  will  now  investigate 
the  magnitude  of  these  terms  after  the  accumulation  of  the 
complex  cross  correlation  function.  The  accumulation  of  the 
complex  cross  correlation  function  can  be  viewed  as  coherently 
summing  cross  spectral  components  computed  from  4  *isec 
segments  of  data  (see  Appendix  A  for  discussion) .  Many  of  the 
errors  introduced  into  the  estimates  of  the  group  delay,  phase 
delay  rate  and  visibility  phase  by  the  approximations  made  in 
developing  the  estimation  algorithms,  can  be  studied  by  ana¬ 
lyzing  the  effects  of  these  approximations  on  the  phase  of  the 
summed  cross  spectral  components.  We  will  now  develop  a 
simple  formula  which  allows  us  to  estimate  the  magnitude  of 
the  cross  spectral  phase  errors  due  to  the  rotation  of  a  cross 
spectral  component  at  an  incorrect  rate.  The  coherently 
averaged  cross  spectrum  was  given  by  (Equation  (A.15) ) : 


n(q)_* 
-  l  * 
j-1 


X  ei(a 
ljX2je 


»i> 


j 


where  (a  )j  is  the  a  priori  expected  change  in  the  cross 


spectral  phase  from  the  beginning  of  the  accumulation  period. 
We  may  use  the  above  formula  to  study  the  error  in  the 
accumulation  of  the  cross  spectrum  by  replacing  the  summation 


with  an  integration  and  replacing  («_).;  by  the  error  in  the 

ap  j 


rotation  of  the  cross  spectrum.  These  substitutions  yield 


AC  ( to)  =  A  fac  e+lAu)et  dt 


E 


(B.4.3) 


where  AC(u>)  is  the  accumulated  cross  spectrum,  A  is  the 
-  *  - 

amplitude  of  xx jX2 j  (assun,ed  constant)  ,  Tac  is  duration  of  the 
accumulation  period  and  Au>et  is  the  error  in  the  rotation  of 
the  cross  spectral  components  at  time  t  (corresponding  to 
index  2  *n  the  summation)  .  (Awe  will  be  used  as  a  general  term 
to  denote  rotation  rate  errors.  We  will  use  it  in  several 
different  contexts  in  the  remainder  of  this  appendix.)  The 
magnitude  of  AC(w)  will  be 

|  AC(oj)  |  -  2A  sin2(Au>eTac/2)/Au)e  (B.4.4) 

We  should  notice  that  as  Ah>e  approaches  zero,  |AC(<o)| 
approaches  AT  „  as  expected  (i.e.,  if  the  cross  spectral 

dC 

components  are  rotated  at  the  correct  rate,  then  their 
coherent  sum  will  equal  their  algebraic  sum.) 

We  now  return  to  the  problem  of  the  USB  spectral  compo¬ 
nents  with  wv<0.  The  error  in  the  rotation  of  these  cross 
spectral  components,  expressed  as  a  frequency,  will  be  Ao>e  ■ 

2«' (*-_)„  (see  Table  B.2.1  for  definitions).  For  a  small 
®P  4 

phase  delay  rate  of  100  psec/sec  (this  value  will  be  exceeded 
most  of  the  time  for  baselines  >10  km  long),  (u>'/2it)*2  GHz, 
and  Tac*  2  sec  (typical  duration  of  an  accumulation  period) , 
we  have  |  AC(u)  |/|C*(u)  |  *  8xl0"3,  where  C®(w)  is  the 
corresponding  USB  cross  spectral  component  with  «v>0.  This 
ratio  is  a  direct  measure  of  the  expected  phase  error  (in  rad) 
of  the  USB  cross  spectrum  with  wy>0,  due  to  the  USB  cross 
spectral  components  with  u>y<0. 

Although  the  phase  error  given  above  is  very  small  (0?5)  , 
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we  will  continue  to  analyze  this  problem  because  the  analysis 
will  provide  us  with  some  techniques  which  will  be  useful 
later  in  this  appendix.  So  far,  we  have  calculated  the  phase 
error  in  a  single  accumulation  period.  But  when  the  delay 
resolution  function  is  computed,  we  will  sum  together  many 
accumulation  periods  (see  Equation  (B.3.4).  How  do  we  compute 
the  errors  after  this  summation?  There  are  two  techniques  we 
could  adopt.  We  could  compute  the  phase  errors  in  each 
accumulation  period,  at  each  cross  spectral  frequency,  and 
then  estimate,  by  least  squares,  the  errors  introduced  by 
these  phase  errors  into  the  estimates  of  the  multiband  group 
delay,  the  singleband  group  delay,  the  phase  delay  rate  and 
the  visibility  phase.  (See  Appendix  A  and  earlier  discussions 
in  this  appendix.)  This  technique  is  useful  if  we  want  to 
correct  the  measurements  for  a  known  error  (this  technique  was 
used  in  the  initial  study  of  the  errors  introduced  by  using 
only  eight  lags) .  However,  if  we  only  wish  to  bound  the 
effect  of  an  error,  this  technique  may  be  "too  exact",  i.e.  it 
may  involve  more  effort  than  is  warranted  for  determining  a 
bound  on  an  error.  We  can  adopt  a  simpler  technique.  To 
demonstrate  the  application  of  this  technique,  we  show,  in 
Figure  B.4.4,  the  aggregate  cross  spectral  component , ACQbg , 

(at  a  single  frequency) ,  after  several  accumulation  periods 
have  been  summed.  This  figure  shows  that  the  aggregate  cross 
spectrum  will  be 

T 

A_  /  »  .  robs  +  x Aw  t  .. 

ACobs(w)  “  A  l  e  e  dt 


(B.4.5) 


Figure  B.4.4  Accumulation  of  cross  spectral  components  as  accumulation 
periods  are  summed 


second  accumulation 
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where  TQbs  is  the  duration  of  the  observation.  The  magnitude 

2 

of  ACQbs  will  be  |ACobg(aj)|  =  2A  sin  (Au>eTobg/2)  /Ao>e .  We  can 
now  estimate  that  the  magnitude  of  the  USB  spectral  components 
with  wv<0,  relative  to  the  USB  spectral  components  with  u>v>0, 
after  a  100  sec  observation,  will  be  50  times  smaller  than  the 
value  quoted  earlier.  (  |  AC(w)  |/jC®((i>)  |  *1.6x10“*  for  (u>’/2it)=2 
GHz  and  tap=100  psec/sec.) 

To  form  the  complex  cross  correlation  function  we  there¬ 
fore  only  need  to  integrate  C^(uv)  from  zero  to  infinity. 

The  procedures  used  to  evaluate  the  errors  introduced  by 
using  only  eight  lags  in  conjunction  with  the  seven  pole 
Butterworth  filter  were  the  same  as  discussed  for  the  perfect 
bandpass  except  that  the  complex  cross  correlation  was 
directly  evaluated  by  numerically  integrating,  rather  than 
analytically  integrating,  the  cross  spectrum.  When  the 
complex  cross  correlation  function  is  directly  evaluated  it  is 
no  longer  necessary  to  average  over  a  cycle  of  u>rTg(t).  To 
see  why  this  is  so  we  need  to  return  to  the  real  (in  the 
complex  sense)  cross  correlation  function  (Equations  (B.4.1)). 
The  only  difference  between  the  calculations  of  the  real  and 
the  complex  cross  correlation  functions  is  the  limits  of  the 
integration  used  in  Fourier  transforming  the  cross  spectrum. 
The  real  cross  correlation  function  is  obtained  by  integrating 
the  cross  spectrum  from  -®  to  ®,  and  represents  the  expecta¬ 
tion  of  the  cross  correlation  function  of  a  short  segment  of 
data  (4  ^sec  duration)  from  the  two  sites.  The  complex  cross 
correlation  function  is  obtained  by  integrating  the  cross 


spectrum  from  0  to  •  and  represents  the  cross  correlation 
function  obtained  from  the  coherently  averaged  cross  spectrum 


The  phase  term,  w^x^t),  appears  in  the  cosine  term  in 
Equation  (B.4.1)  and  changes  the  shape  of  the  correlation 
function  as  x^(t)  varies  (see  Figure  B.l.l.a).  These  changes 
in  the  shape  of  the  cross  correlation  function  will  affect  the 
errors  introduced  by  using  only  8  lags  (see  insert  in  Figure 
E.4.1  and  accompanying  explanation).  The  changes  in  wrtg(t) 
will  also  affect  the  complex  cross  correlation  function,  but 
in  this  case  the  shape  of  the  correlation  function  will  be 
unchanged  as  x^(t)  changes.  This  constancy  of  sphape  may  be 
easily  seen  by  writing  the  expression  for  the  complex  cross 
correlation  function.  R^(A-c^)  will  be  given  by 


)  -  e^rV1'  /  H1<»v)H;(Uv)ei%VtVi"v<i'VV 

%  (B.4.6) 


where  we  have  included  x  because  the  lag  is  taken  about  this 

ap 

value,  i.e.  x  =  xap+Ax^  (see  Table  B.2.1  and  Section  B.2) . 

we  should  notice  that  u)rTg(fc)  appears  outside  of  the 
integration  and  hence  will  only  rotate  (in  the  complex  plane) , 
the  complex  cross  correlation  function,  and  not  change  the 
shape  of  this  function.  (If  we  replace  the  Butterworth  filter 
transfer  functions  in  the  above  equation,  with  the  transfer 
function  of  a  perfect  bandpass  filter  (i.e.,  H^(u>v)=l  for 
-B<u>v<B  and  zero  elsewhere)  we  see  that  R  u(Axj)  reduces  to 
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where  sinc(a) =sin(a) /a.  We  see  from  the  above  equation  that 

c  , 

Ru(Ax^)  *s  a  s*nc  function  rotated  in  the  complex  plane  by 

a>rTg(t)+*B<Tg(t)-AT.-Tap)  . 

The  cross  spectral  components  which  are  generated  from 
the  complex  cross  correlation  function,  evaluated  at  eight 
lags,  will  only  rotate  as  xg(t)  is  varied  by  small  amounts,  of 
order  2it/wr,  i.e.  each  cross  spectral  component  will  be 
multiplied  by  e-lwrTg^t^  independently  of  its  video  frequency 
because  e-lu)rTg(t)  can  be  taken  outside  of  the  calculation. 

The  amplitude  and  phase  errors  in  the  cross  spectral 
components  will  remain  the  same  over  these  small  variations  in 
xg  because  the  shape  of  the  cross  correlation  function  will 
not  change.  (The  shape  of  the  correlation  function  will 
change  with  variations  in  xg  of  the  order  of  1/B) . 

The  results  obtained  using  the  complex  cross  correlation 
function  should  be  exactly  the  same  as  those  obtained  from  the 
real  cross  correlation  function  after  we  have  averaged  over  a 
cycle  of  wrTg ( t) . 

In  Figure  B.4.5,  the  estimated  errors  in  the  relative 
correlation  amplitude,  the  singleband  delay  and  the  visibility 
phase  are  shown  for  errors  in  the  a  priori  delay  that  vary 
from  -0.6  nsec  to  0.8  nsec.  In  addition  we  also  show  the 
results  obtained  from  the  repeated  correlations  of  a 
Westf ord-For t  Davis,  USB,  S-band  observation  of  the  radio 
source  3C  273B  on  July  7,  1982  at  02:34  UT.  The  experimental 
and  theoretical  results  match  reasonably  well  although  there 
are  some  differences.  (These  differences  will  be  discussed 


Relative  correlation 
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after  we  have  studied  the  multiband  group  delay  results.) 

For  the  geodetic  applications  of  VLBI  we  are  interested 
in  the  multiband  group  delay  errors.  What  magnitude  errors  do 
we  expect  because  of  the  use  of  8  lags?  We  should  expect  zero 
errors  because  the  phase  errors  in  each  channel  are  indepen¬ 
dent  of  the  radio  frequency.  (The  only  term  involving  the 
radio  frequency  is  wrTg(t).  As  we  have  just  seen  this  term 
can  be  taken  outside  of  the  calculations  (with  the  exception 
of  small  fringe  rate  observations  (see  discussion  associated 
with  Equation  (B.4.3)),  and  hence  does  not  affect  the  errors 
introduced  by  using  only  eight  lags.)  In  Figure  B.4.6,  we 
present  the  variations  in  the  estimates  of  the  multiband  group 
delay  results  found  from  the  repeated  correlations.  Clearly 
the  observed  variations  are  not  zero.  Over  the  normal  range 
of  ±0.1  nsec  error  in  the  a  priori  delay ,  the  multiband  delay 
changed  by  50  psec. 

To  investigate  these  observed  multiband  group  delay 
variations,  we  need  to  firstly  look  at  the  "performance"  of 
the  correlator,  i.e.  had  we  repeated  these  correlations,  not. 
changing  the  a  priori  delay,  what  variations  would  we  have 
seen?  The  answer  unfortunately  is  not  zero,  even  though  the 
correlator  is  a  digital  system  and  hence  should  be  able  to 
exactly  reproduce  results.  The  reproducibility  problems  seem 
not  to  be  with  the  correlator  (in  general)  but  with  reading 
the  magnetic  tapes.  (In  some  cases,  hardware  problems  do 
develop  in  the  correlator  modules  causing  nonreproducibility 
with  a  particular  module  with  a  hardware  error.  These  cases 


Figure  B.4.6  Variations  in  the  estimate  of  the  multiband  group  delay  as 
a  function  of  the  a  priori  delay  error-  S  band 
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are  rare  and  will  not  be  discussed  here.)  To  indicate  the 
magnitude  of  nonreproducibility  (i.e.,  the  differences  between 
measurements  when  tapes  are  recorrelated  using  the  same  a 
priori  parameters),  we  show  in  Figure  (B.4.7),  a  histogram  of 
differences  between  the  multiband  group  delays  obtained  from 
repeated  correlations  of  21  observations  of  the  radio  source 
pair  1038+52A  and  B  (Marcaide,  1982) .  We  have  plotted  the 
histogram  with  bins  defined  by  the  difference  in  estimates  in 
the  multiband  group  delay  divided  by  their  standard  deviation. 
(This  form  was  chosen  because  earlier  reproducibility  results 
had  indicated  that  the  nonreproducibility  was  a  function  of 
the  standard  deviations  of  the  estimates.) 

If  we  now  reexamine  Figure  B.4.6  and  compare  the  results 
with  the  possible  variations  in  the  multiband  group  delay  with 
repeated  correlations  (Figure  B.4.7),  we  see  that  the  varia¬ 
tions  in  Figure  B.4.6  can  be  explained  by  correlator  non¬ 
reproducibility.  To  further  study  this  problem,  the  X  band 
results  from  these  repeated  correlations  (at  different  lags) 
were  analyzed.  These  results  are  shown  in  Figure  B.4.8. 

Again  we  see  variations  in  the  multiband  group  delays  which 
are  consistent  with  the  nonreproducibility  of  the  results. 

Also  the  X-band  variations  show  no  correlation  with  the  S-band 
variations . 

To  confirm  that  the  effects  of  errors  in  the  a  priori 
delay  are  zero  (or  very  small),  we  could  repeat  these  tests 
and  average  the  results  of  the  repeated  correlations.  To 
reduce  the  pseudo-noise  due  to  correlator  nonreproducibility. 


'  U*L  ■Vi'L'"L  L’’*  I’ll  '.^  t”S.  ■-'■%  l"VJ  ■„*  ^  1  ^.— 
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to  10  psec,  would  cequice  at  least  25  repeated  correlations  at 
each  of  the  14  different  values  of  the  a  priori  delay.  These 
350  repeated  correlations  would  be  a  large  committment  of 
(limited)  resources. 

There  may  be  a  better  approach.  In  May,  1983  a  geodetic 
experiment  was  carried  out  using  the  100  m  diameter  antenna  at 
Effelsburg  in  the  Federal  Republic  of  Germany.  The  multiband 
group  delay  estimates  from  baselines  involving  this  antenna 
can  have  standard  deviations  of  a  few  picoseconds.  We  should 
use  these  data  to  analyze  the  effects  of  errors  in  the  a 
priori  delay.  This  analysis  will  not  be  possible  before  the 
completion  of  this  thesis. 

The  correlator  nonreproducibility  would  also  explain  the 
differences  between  the  observed  variations  and  the  predicted 
differences  seen  in  Figure  B.4.6.  Again,  the  repeated 
correlations  of  baselines  involving  the  Effelsburg  antenna 
should  confirm  if  this  is  the  actual  reason  for  these  differ¬ 
ences  . 

All  of  the  results  presented  so  far  have  assumed  that 
only  USB  data  were  available.  We  will  now  consider  the 
effects  of  using  eight  lags  when  only  LSB  data  are  available. 
For  the  simplified  case  we  are  currently  studying  (see 
earlier  discussion) ,  the  lower  sideband  cross  spectrum, 

C][(wv)  ,  is  given  by  (see  Equation  (B.1.5.b)) 

C5<V  *  Hi  (VH2  exp[  i  (u>v+u>r)  (t) )  (B.4.8) 


where  the  upper  signs  refer  to  u>v>0  and  the  lower  signs  to 


uv<0.  We  have  written  the  cross  spectrum  in  the  same  form  as 
in  Equation  (B.4.2).  Only  the  negative  video  frequencies  from 
the  LSB  are  used  in  the  computation  of  the  cross  spectrum  (see 
discussion  concerning  Equations  (B.3.1)  and  (B.3.2)),  and  in 
the  discussion  on  the  complex  cross  correlation  function 
associated  with  Equation  (B.4.2).  A  review  of  the  discussion 
associated  with  Figure  2.1.3.  may  also  be  useful  at  this  time. 
For  wv<0  the  cross  spectrum  becomes 

CX(<V  *  Hi(a,v)H2(<V  exp[-i|wv|  +  iwrTg(t)] 

where  we  have  explicitly  shown  the  sign  associated  with  wv  by 
using  its  absolute  value  in  the  exponent.  We  would  form  the 
complex  cross  correlation  function  from  this  cross  spectrum  by 

R^(M  )  =  e1UrTg(t)  J  Hj.  (-  |wj  )H*  (-  |u>v| )  e_1l“v,Tg 

ei  Iwv|  (ATi  +  Tap)  d  |  u>v  |  .  (B.4.9) 

If  we  compare  this  equation  with  the  equation  for  the  upper 
sideband  complex  cross  correlation  function  (Equation  B.4.6)), 
we  see  that  they  are  very  similar  (as  we  should  expect)  except 
for  some  sign  changes.  To  investigate  the  effects  of  these 
sign  changes,  we  will  analytically  evaluate  Ra(At.) .  To  do 
this  we  will  replace  the  Butterworth  filter  responses  with  a 
flat  bandpass  filter  (this  replacement  will  give  us  an 
equation  which  we  can  directly  compare  with  (Equation  B.4.7)). 


(B.4.10) 


With  the  flat  bandpass  filter,  R^(Ax^)  becomes, 
R=(4*.)  .  ei“tTg<t)-1'B(xg,t,-4Ti-''ap) 


sinc[«B('t9(t)-ati-tap)) 

In  Figure  B.4.9,  we  show  a  graphical  comparison  of  R^fAx^)  and 
R^(ATi),  (Equation  B.4.7).  We  should  notice  that  the  magni¬ 
tudes  of  R^(Ax^)  and  R^tAx^)  as  functions  of  Ax^  are  exactly 
the  same;  the  arguments,  however,  have  opposite  sign  slopes 
(as  functions  of  Ax^),  intersecting  Axj=0  at 

-wr*g(t) +*B(xg(t)-xap)  and  -urxg(t)-xB(xg(t)-xap) ,  respective¬ 
ly.  (These  intercepts  may  appear  to  have  incorrect  signs. 

This  change  of  sign  occurs  because  the  Fourier  transform  of 
these  correlation  functions  will  generate  the  complex  conju¬ 
gates  of  the  cross  spectra  (see  discussion  at  the  beginning  of 
this  appendix) .) 

When  the  estimates  of  the  cross  spectrum  are  calculated 
from  the  complex  cross  correlation  functions,  the  contribution 
of  the  cross  correlation  function  at  each  lag,  (rotated  by 
e^u>v&Ti)  ,  to  the  summation  will  be  a  constant  rotation  (inde¬ 
pendent  of  the  video  frequency  being  computed,  i.e. 
-wrTg(t)-iiB(xg(t)-xap)  ,  for  USB,  and  -u>rxg  ( t)  +wB(xg  ( t) -xap)  , 
for  LSB) ,  plus  a  rotation  which  will  have  opposite  signs  for 
the  upper  and  the  lower  sidebands  ( f u>v-xB)  Ax ^  and 
—  1 1  u>v  |  —  xxB  ]  Ax  ^  ,  respectively).  (Remember,  only  u)y>0  should  be 
used  with  R^(Ax^)  and  u>v<0  with  R^(Ax^)  .)  Consequently,  the 
phase  errors  of  the  cross  spectral  components  (i.e.,  the  dif¬ 
ference  between  cross  spectrum  phase  at  u>v  and  correct  phase 
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at  u)v  (given  by  wrTg  ( fc)  +“v  (*g  ( fc)  “''ap)  >  '  ^r0in  upper  and  lower 
sidebands  will  have  opposite  signs  (see  Figure  B.4.10). 

Figure  B.4.10  easily  allows  us  to  deduce,  from  the  USB 
results,  the  effects  of  using  only  eight  lags,  when  only  LSB 
data  are  available.  The  singleband  delay  errors  will  be  the 
same  for  each  sideband.  The  phase  error  at  video  DC,  which  is 
the  visibility  phase  error,  will  have  opposite  sign  for  upper 
and  lower  sideband  data.  When  uSB  and  LSB  data  are  combined 
(in  equal  proportions),  the  visibility  phase  errors  will  be 
zero  (this  may  be  seen  from  Figure  B.4.10;  the  weighted  least 
squares  fit  of  a  slope  and  intercept  through  the  cross 
spectral  phases  from  both  side  bands  will  intercept  <*>v=0  at 
the  correct  visibilty  phase) .  (Normally,  the  phase  of  the 
cross  spectrum  of  heterodyned  signals,  at  the  u>v=0  boundary, 
is  not  continuous  (i.e.,  the  phase  changes  sign  for  real 
signals) ;  this  discontinuity  is  not  the  case  in  Figure  B.4.10 
because  this  figure  does  not  represent  the  pure  cross  spectrum 
of  a  heterodyned  signal  (see  discussion  about  Equation 
(B.1.1)).) 

It  should  be  noted  that  if  the  a  priori  delay  is  in  error 
by  more  than  0.8  nsec,  then  the  correlation  amplitude  will 
drop  rapidly  and  the  wrong  peak  in  the  delay  resolution 
function  could  be  selected  when  estimating  the  multiband  group 
delay.  If  the  wrong  peak  is  selected  then  the  visibility 
phase  and  the  estimates  of  the  quantities  will  be  useless. 

At  the  beginning  of  this  discussion  we  assumed  that  t 

g  p 

and  that  A4>u  ( t)  =0.  If  these  two  assumptions  are  removed,  will 


our  conclusions  be  changed?  No.  The  only  effect  will  be  to 
introduce  an  additional  phase  term  to  the  exponent  outside  of 
the  integration  used  to  compute  the  complex  cross  correlation 
functions  (see  Equations  (B.4.6)  and  (B.4.9)).  Since  these 
additional  terms  are  independent  of  At^  and  wv ,  there  presence 
will  not  affect  the  calculation  of  the  errors. 

Before  finishing  with  the  effects  of  using  only  eight 
lags  we  should  stress  that  the  estimated  errors  will  only  be 
valid  if  the  phase  delay  rate  is  sufficiently  large  that 
(j^ipAt  will  rotate  through  several  cycles  during  the  observa¬ 
tion  (typically  >100  seconds).  For  S-band  data,  the  phase 
delay  rate  would  need  to  be  greater  than  100  psec/sec  to 
ensure  25  cycles  of  rotation  of  u>r*pAt,  in  which  case  the 
visibility  phase  errors  in  Figure  B.4.6  could  be  incorrect  by 
2.6  deg  (see  discussion  on  the  complex  cross  correlation 
function  (Equation  (B.4.5)). 

The  next  error  we  will  discuss  is  the  use  of  the  3  level 
sine/cosine  function  to  carry  out  rotations  (this  function  is 
shown  in  Figure  B.4.11).  This  problem  can  be  viewed  very 
elegantly  from  a  Fourier  series  approach.  If  we  refer  back  to 


Equation  (B.2.7)  we  see  that  the  complex  cross  correlation 
function  R^(Ax^)  was  given  by,  (see  Table  B.2  for  symbols) 

c  N(<5>  1  1  b 

V4V  -  *  *l<VP“s)X2(V,,ap’p+P4V1,i) 


expl-iw* (*ap)q(tp-tq) ) 


If  we  group  terms  in  this  expression  by  setting 

r(PAts,Ax.)  =  x1(tq+PAts)x2(t^(T^p)p+PAts-ATi) 

and  fap  *  (tap)q  '  then 
c  N(^} 

Ru(ATi)  =  r(PAts'ATi)expf“ifappAts1  (B.4.11) 

Equation  (B.4.11)  should  be  recognized  as  the  discrete  Fourier 

series  transform  of  r(pAt  ,Ax.)  at  frequency  f  .  To  deter- 

s  i  dp 

mine  the  errors  introduced  by  using  the  3  level  sine/cosine 
function  we  can  use  the  convolution  theorom  of  Fourier  trans¬ 
forms  (Bracewell ,  1978,  pp.  108-112).  When  the  3  level 
sine/cosine  is  used  to  evaluate  Equation  (B.4.11) ,  the  results 
obtained  will  be  the  product  of  the  Fourier  coefficient  we 
want  and  the  Fourier  transform  of  the  3  level  sine/cosine 
function.  The  3  level  sine/cosine  function  is  periodic  and 
hence  can  be  expanded  in  Fourier  series.  The  first  five 
coefficients  of  the  Fourier  series  for  the  3  level  sine/cosine 
functions  are 
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The  first  effect  of  using  the  3  level  sine/cosine  function 
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will  be  to  multiply  the  correlation  amplitude  by  (4/w) cos(w/8) 
(i.e.,  an  increase  of  -18%).  This  increase  in  amplitude  is 
compensated  after  the  summation  (B.4.11)  is  completed  by 
dividing  the  magnitude  of  the  singleband  delay  function 
(Equation  (B.3.3))  by  (4/it)  cos(it/8 )  . 

The  increase  in  correlation  amplitude  is  the  major  bias 
introduced  by  the  3  level  sine/cosine  approximation  provided 
f  is  sufficiently  large  that  the  f  pAt  rotates  through  a 

dp  dp  S 

large  number  of  cycles  during  an  observation.  (See  discussion 
below.) 

For  small  fringe  rates,  the  Fourier  series  coefficents  of 
the  3  level  sine  function  will  be  different  from  the  coef- 
ficent  of  the  3  level  cosine  function  if  these  functions  have 
not  rotated  through  an  integral  number  of  cycles.  The  differ¬ 
ence  in  the  coefficents  will  introduce  errors  in  both  the 
visibility  phase  and  the  correlation  amplitude. 

We  can  investigate  the  magnitude  of  these  phase  and 
amplitude  errors  by  calculating  the  difference  between  the 
Fourier  series  coefficients  of  the  3  level  sine  and  cosine 
functions.  After  the  3  level  sin/cosine  has  cycled  through 
Nsc  cycles,  i.e.  integer  value  of  fapT0bs/(2it ^  '  where  Toljs  is 
the  duration  of  the  observation,  the  maximum  difference 
between  the  sine  and  cosine  Fourier  series  coefficients  will 
occur  when  fapT0bS  ec3uals  (Ngc+l/8)  ,  (see  Figure  B.4.12). 

For  this  value  of  the  argument  of  the  3  level  sine/cos 
function,  the  Fourier  series  coefficients  will  be 
[ (4Nsc+l)cos(n/8)  -  cos(x/4)]/l*(Nsc+l/8)]  and 


I (4N  +l)cos(x/8)  +  sin(x/4)]/(n(Ne  +1/8)] ,  for  the  3  level 

sine  and  3  level  cosine  functions,  respectively.  The  maximum 

difference  between  the  coefficients  will  be 

[cos(x/8)-2sin{n/4)]/[x (Nsc+l/8 ) ]  «  -0 . 16/ (Ngc+l/8 ) .  This 

difference  could  lead  to  cross  spectral  phase  errors  of  up  to 

8°  (0.14  rads),  and  correlation  amplitude  errors  of  14%.  But 

we  also  notice  that  the  phase  errors  decay  rapidly  as  fapTobs 

increases.  Only  on  short  baselines  (<10  km  length)  will 

f  T  be  small  for  extended  periods  of  time  (more  than  one 

ap  obs  r 

observation).  For  a  phase  delay  rate  of  100  psec/sec,  wr=2.2 
GHz,  and  Tobs®l°0  sec,  the  maximum  cross  spectral  phase  error 
due  to  the  3  level  sin/cos  approximation  will  be  0?4. 

The  effects  of  the  3  level  sine/cosine  approximation  on 
the  multiband  group  delays  may  be  calculated  from  the  above 
results.  The  different  fringe  rates  (i.e.,  fap) ,  for  each 
channel  in  the  synthesized  band  will  lead  to  different  phase 
errors  in  each  channel,  and  hence  introduce  multiband  group 
delay  errors.  For  the  case  given  above  and  a  synthesized 
bandwidth  of  80  MHz  (typical  for  S-band) ,  the  multiband  group 
delay  error  could  reach  0.030  nsec  (again,  only  for  short 
baselines  will  errors  of  this  magnitude  occur) . 

The  third  approximation  we  will  study  is  the  neglect  of 
the  video  rotation  term.  While  deriving  the  complex  cross 
correlation  function  we  neglected  the  term 
expl-iu>v jtp (tp_tq)  3  ,  (see  Equation  (B.2.3)).  This  rotation 
represents  changes  in  the  phases  of  cross  spectral  components 
across  the  video  frequency  range  due  to  the  total  phase  delay 
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rate.  Since  w'  (see  Table  B.2.1),  is  chosen  at  the  center  of 
the  video  band,  the  phase  errors  due  to  the  neglect  of  the 
video  rotation  will  be  zero  for  the  frequency  at  the  center  of 
the  video  band.  For  the  other  frequencies  in  the  cross 
spectrum  there  will,  however,  be  errors  introduced  by  neglect¬ 
ing  the  video  rotation. 

The  effects  of  the  neglect  of  the  video  rotation  term 
were  studied  by  Whitney  (1974)  for  the  Mark  I  VLBI  system.  A 
sufficiently  large  number  of  the  characteristics  of  the  Mark  I 
system  are  different  in  the  Mark  III  system  for  the  effects  of 
this  video  rotation  term  to  be  re-evaluated.  The  maximum 
error  due  to  this  term  occurs  in  the  video  frequency 
components  near  the  edges  of  the  bandwidth.  The  bandwidth  of 
the  recording  system  increased  from  360  KHz  to  2  MHz  as  we 
changed  from  Mark  I  to  Mark  III.  This  increase  in  bandwidth 
will  increase  the  error  in  the  phase  of  the  cross  spectral 
components  near  the  edges  of  the  bandwidth  by  a  factor  of  5.6, 
i.e.  by  the  radio  of  bandwidths.  The  length  of  an 
accumulation  period  has  also  been  increased  by  a  factor  of  10, 
i.e.  the  accumulation  period  has  changed  from  0.2  sec  to  2 
sec.  The  combination  of  these  two  changes  has  increased  the 
error  due  to  the  neglect  of  video  rotation  by  a  factor  of  56 
over  the  values  given  by  Whitney.  If  we  use  a  maximum  value 
for  the  phase  delay  rate  of  2x10“®  sec/sec  and  an  accumulation 
period  of  2  sec,  the  error  accumulated  at  the  edge  of  the 
video  band  (1  MHz  from  the  center  frequency)  would  be  4 
cycles.  This  error  is  unacceptably  large.  Clearly  the  effect 
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of  the  error  due  to  video  rotation  must  be  reduced. 

To  reduce  this  error  we  must  first  realize  that  the 
rotations  applied  to  the  data  streams  during  correlation  do 
not  exactly  compensate  for  a  change  in  the  arrival  times  of 
the  signals  at  each  site.  Only  the  center  frequency  of  the  2 
MHz  bandwidth  has  the  correct  rotation  applied.  The  other 
frequencies  in  the  cross  spectrum  are  rotated  by  an  angle 
which  is  error  by  (wv-tiB)  Tap{tj-tg)  .  One  method  we  could  use 
to  solve  this  problem  would  be  to  accumulate  a  cross  correla¬ 
tion  function  for  each  of  the  video  cross  spectral  frequencies 
whose  components  are  be  estimated.  ltiis  approach  however 
would  require  a  sevenfold  increase  in  the  storage  requirements 
in  the  correlator  modules  and  in  FRNGE.  An  alternative 
approach  is  to  always  keep  the  bit  offset  between  the  tapes 
within  ±0.5  of  the  bit  spacing  (0.25  nsec)  .  Each  time  a  bit 
shift  is  applied  there  should  be  a  90°  phase  shift  applied  to 
compensate  for  the  change  in  lag  index.  (See  Equation 
(B.4.7);  a  change  in  xgp  by  0.25  ^sec  will  rotate  R^(At.)  by 
*BAt  ,  which  equals  90°  (see  Table  (B.2.1)).  The  rotation  is 

5 

applied  to  compensate  for  this  change.  When  we  developed  the 

correlator  algorithms  (Equation  (B.2.7))  we  included  a  term 

for  changing  the  bit  shift  during  correlation,  ((t^  ).).)  The 

ap  j 

use  of  bit  shifts  does  not  eliminate  the  video  rotation  error. 

The  pattern  of  rotation  errors  of  the  cross  spectral 
component  at  video  frequency,  wv,  is  shown  in  Figure  B.4.13. 

Each  discontinuity  corresponds  to  a  bit  shift  occurring, 

,  o 

accompanied  by  the  90  phase  rotation.  The  accumulated  error 


Figure  B.4.13  Notation  and  geometry  of  the  viseo  rotation  error 


of  the  cross  spectral  component  at  will  be  the  average  of 
"saw  tooth"  phase  errors  shown  in  Figure  B.4.13.  This  accumu¬ 
lated  phase  error,  A4» ac r  will  be  given  by 

(w  -wB)  .  -(<«>  -uB)  . 

Mac  =  ft— - +A4.]At£/2  +  [ - ^ -  +A#f]AtJ/2}  /T 

ac  4B  id  4B  r  o  ac 

(B.4.12) 

where  A<t>^  and  A$^  are  the  rotation  errors  at  the  beginning  and 

the  ends  of  the  accumulation  period,  respectively,  (see  Figure 

B.4.13) ,  A t^  is  the  interval  of  time  between  the  beginning  of 

the  accumulation  period  and  the  first  bit  shift,  and  Atb  is 

the  interval  of  time  between  the  last  bit  shift  and  the  end  of 

the  accumulation  period,  B  is  the  bandwidth  and  T  is  the 

dc 

duration  of  an  accumulation  period  (see  Table  B.2.1) . 

Equation  (B.4.12)  can  be  reduced  to  a  simpler  form  by  noting 
that  from  Atb  and  At^,  A$^  and  A$^  can  be  calulcated.  From 
geometry,  in  Figure  B.4.13,  we  have 

A*x  *  (wv-nB)/(4B)  -  (uv-nB)  AtJ/(2BAtbg) 

and 

A*f  -  -(tov-xB)/(4B)  +  («v-hB)  Atb/(2BAtbg) 


where  Atbs  is  the  interval  of  time  between  bit  shifts  (i.e., 
Atbs  =  l/(2fapB) ) .  After  some  simple  algebra.  Equation 
(B.4.12)  reduces  to 


to  -ttB 
V 


(AtJ-AtJ)[l-(AtJ+AtJ)/Atb8]/Tac  (B.4.13) 


The  above  equation  may  be  simplified  further  by  expressing 
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At^  and  Atb  as  functions  of  the  fractional  bit  shift  at  the 

center  of  the  accumulation,  FB,  (  FB=  t®  /At  -It®  /At  +.51) 

flp  s  «p  s 

where  T^p  is  the  a  priori  delay  at  the  center  of  the  accumula¬ 
tion  period,  Atg  is  the  sampling  interval  and  *  »  denotes  the 
largest  integer  smaller  than  the  quantity  inside  the  symbols) , 
and  the  "floating  point"  number  of  bit  shifts  in  the 
accumulation  period,  S,  (S  *  (*ap) qTac/Atg) .  (The  actual 
number  of  bit  shifts  in  an  accumulation  period  will  be  IS*). 
From  Figure  B.4.13  we  can  deduce  that 

AtJ  =  (S/2  -  Nbl  -  FB  +  0.5)  Atbg  (B.4.14.a) 

and 

Atb  *  (S/2  -  Nb2  +  FB  +  0.5)  Atbg  (B.4.14.b) 

where  Nbl  and  Nb2  are  the  numbers  of  bit  shifts  before  and 
after  the  center  of  the  accumulation  period;  Nbl  +  Nb2  “  ,SI* 
By  adding  and  subtracting  the  two  Equations  (B.4.14),  we 
obtain 

-  Atb  -  <Nb2  -  Nbl  '  2FB»  4tbs 

and 

“b  -  4tb  *  (S  -  "bl  '  Nb2  +  1(  4tbs- 
Substitution  of  these  results  into  Equation  (B.4.13)  yields 

4*,c  ■  “  [ PB- (Nb2-Nbl ) ] [ s- <"bl+Nb2  > 1  4tbs/Tac 
ZB 

(B.4.15) 

Finally,  we  can  eliminate  Nb^  and  Nb2  from  the  above  equation 


by  noting  that 


^Nb2_Nbl^2  =  0  ,  if  IS H  is  even 

=  1/2  ,  if  ISI  is  odd  and  FB  <  0 
=  -1/2,  if  I S X  is  odd  and  FB  >  0. 

We  will  denote  (Nb2-Nbl^2  by  “b'  wh*ch  yields  for  the  error 
due  to  the  neglect  of  video  rotation 

(a)  -HB 

A»ac  =  [ra-«b][s-ISI)]  4tbs/T„c  .  (B.4.16) 

2B 

In  Figure  B.4.14,  we  show  the  error  in  the  phase  of  the 
video  DC  cross  spectral  components  which  result  from  neglect¬ 
ing  the  video  rotation  as  a  function  of  the  time  measured  in 
bit  shifts,  (one  unit  of  bit  shift  time  is  Atbg) .  To  generate 
this  figure  we  assumed  that  Atb=0,  (see  earlier  discussion) . 
(The  video  DC  frequency  component  is  not  computed  from  the 
cross  correlation  function  but  the  error  at  this  frequency 
will  represent  the  visibility  phase  error  after  the  singleband 
delay  has  been  estimated,  see  discussion  on  Equation  (B.3.3).) 

As  with  earlier  errors,  we  see  that  this  error  will  only 

have  a  significant  effect  on  observations  with  small  phase 

delay  rates.  For  a  phase  delay  of  100  psec/sec,  the 

visibility  phase  error,  (i.e.  the  phase  error,  u>v*0)  ,  could 

o 

be,  after  a  100  sec  observation,  as  high  as  22.5,  (S»0.04  and 
hence  it  is  unlikely  that  there  will  be  any  bit  shifts  in  the 
observation) .  (We  can  extend  the  results  from  an  accumulation 
period  to  the  observation  duration  using  the  same  argixnents 
which  were  presented  with  Equations  (B. 4. 3) - (B. 4. 5) ) .  This  is 
a  very  large  error  and  steps  have  now  been  taken  to  eliminate 


Phase  error  at  DC  due  to  the  neglect  of  video  rotation 


Time  from  beginning  of  accumulation  period 
(measured  in  bit  shifts) 
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this  error  (see  discussion  later). 

Dual  sideband  data  will  not  be  affected  by  this  error, 
when  the  sidebands  are  combined  in  equal  proportions,  because 
the  LSB  error  will  have  opposite  sign  to  the  USB  error.  (This 
sign  change  may  be  shown  using  arguments  similar  to  those  used 
when  we  were  studying  the  effects  of  using  eight  lags;  see 
Equations  (B.4.8)-(B.4.10) .)  If  the  proportions  of  upper  and 
lower  sidebands  are  not  equal,  the  error  may  be  calculated 
from  the  weighted  mean  of  the  upper  and  lower  sideband  errors 
(see  discussion  associated  with  Equation  (B.3.5)). 

The  multiband  group  delays  will  not  be  affected  by  this 
error,  unless  there  are  data  losses  in  the  observation, 
because  the  error  does  not  depend  on  the  RF  frequency  (see 
Equation  (B.4.16)).  A  10%  data  loss  in  one  channel  may  intro¬ 
duce  differences  in  the  phase  errors  between  channels  of  up  to 
o 

2  ,  (assuming  the  data  loss  occurs  only  immediately  after 
every  bit  shifts,  which  is  unlikely) ,  and  hence  may  introduce 
errors  in  the  multiband  group  delays.  (This  topic  is  explored 
in  further  detail  in  Chapter  3.) 

The  video  rotation  term  will  introduce  a  bias  into  the 
correlation  amplitude.  If  the  power  spectral  density  of  the 
recorded  signals  is  constant  over  the  2  MHz  bandwidth,  the 
amplitude  of  the  cross  spectral  components  at  frequencies 
different  from  that  of  the  center  of  the  video  band  will  be 
reduced  because  of  the  incorrect  phase  rotation.  The 
amplitude  reduction  of  a  cross  spectrum  component  with 
frequency  w  will  be  (if  several  bit  shifts  occur  during  the 


observation  so  that  we  can  ignore  the  contributions  from  At£ 
and  At^  (see  Figure  B.4.13  and  the  discussion  associated  with 
Figure  B. 4.4)  , 

®  s  —  i  ft 

|  /  e  d0  |/29s  =  2  sin(6g)/[  (a)v-itB)  Atg]  (B.4.17) 

—  0 

S 

where  6g  =  (wv-xB) Atg/2 .  For  the  1.75  MHz  and  the  0.25  MHz 
video  frequency  components  the  reduction  in  amplitude  will  be 
5.7%.  The  reduction  in  amplitude  when  the  seven  video  fre¬ 
quencies  are  averaged  will  be  2.5%.  This  bias  may  be  cor¬ 
rected  by  multiplying  the  observed  correlation  amplitude  by 

*1.026  .  This  type  of  correction  is  made  in  FRNGE  but  the 

2 

factor  used  is  1/(1-*  /288]  *1.035.  The  correction  applied  in 
FRNGE  is  the  amplitude  correction  which  would  be  applicable  if 
very  closely  spaced  (<<0.25  MHz)  frequencies  in  the  video 
cross  spectrum  were  evaluated  (Whitney,  1974,  pp.  164-166, 
i.e.,  Whitney  integrated  the  bias  over  the  video  bandwidth 
rather  than  averaging  the  biases  for  each  of  the  frequencies 
used)  . 

After  10  August, 1982,  the  error  in  total  phase  due  to  the 
neglect  of  the  video  rotation  was  eliminated  by  applying  a 
(video)  frequency  dependent  rotation  correction,  (Equation 
(B.4.16)),  to  the  video  cross  spectral  components  computed  in 
FRNGE. 

We  will  now  discuss  the  effects  of  the  last  approximation 
listed  at  the  beginning  of  this  discussion:  possible  biases 
caused  by  an  error  in  the  a  priori  phase  delay  rate.  To 
analyze  this  problem  it  is  convenient  to  visualize  the  accumu- 


lation  of  the  cross  correlation  function  in  terms  of  coher¬ 
ently  averaging  the  cross  spectral  components  (this  is  the 
method  which  we  originally  discussed  in  Appendix  A) .  The 
coherent  average  was  formed  by  summing  together  the  cross 
spectral  components  from  4  nsec  data  segments.  Each  cross 
spectral  estimate  was  rotated  by  the  expected  change  in  the 
phase  of  its  components  based  on  an  a  priori  estimate  of  the 
phase  delay  rate.  Almost  always  the  a  priori  phase  delay  rate 

is  known  to  within  10”*1  sec/sec  and  hence  the  error  in  the 

o 

rotation  of  the  cross  spectral  components  is  bounded  by  58 
over  a  2  second  accumulaton  period  at  an  observing  frequency 
of  8  GHz.  This  error  is  not  so  severe  as  it  first  appears 
because  the  estimates  of  the  cross  spectral  components  are 
referred  to  the  epoch  at  the  center  of  the  accumulation 
period.  The  phase  error  due  to  an  error  in  the  a  priori  phase 
delay  rate  will  be  zero  for  this  epoch  if  all  of  the  data 
recorded  during  the  accumulation  period  are  used  in  computing 
the  estimates  of  the  cross  spectral  components.  All  of  the 
data  recorded  during  an  accumulation  period  may  not  be  used  to 
estimate  the  cross  spectrum,  for  example  because  of  tape 
reading  errors  during  correlation.  The  error  in  the  phase  of 
the  coherently  averaged  cross  spectrum  due  to  data  losses  will 
depend  upon  the  times  during  the  accumulation  period  at  which 
data  are  lost.  The  worst  possible  error  is  58°  when  only  a 
few  bits  of  data  from  either  the  beginning  or  the  end  of  the 
accumulation  period  are  used  to  compute  the  cross  spectrum. 
This  error  could  be  positive  or  negative  depending  upon 
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whether  or  not  the  segment  is  at  the  beginning  or  the  end  of 
the  accumulation  period.  In  general,  the  error  in  the  phase 
of  the  cross  spectral  components  could  be  positive  or  negative 
depending  upon  whether  or  not  data  are  lost  primarily  before 
or  after  the  temporal  center  of  the  accumulation  period.  A 
systematic  bias  could  develop  in  the  phase  errors  if  data  were 
lost  systematically  from  before  or  after  these  center  epochs. 
If  this  systematic  loss  of  data  occurred,  the  error  in  the 
phase  delay  would  still  be  bounded  by  the  product  of  the  phase 
delay  rate  error  and  the  duration  of  the  accumulation  period 
(for  an  error  of  10~^  sec/sec  in  the  phase  delay  rate  and  an 
accumulation  period  of  2  sec,  the  maximum  error  in  the  phase 
delay  would  be  10  psec  *0.3  cm).  If  the  same  systematic  data 
loss  occurs  in  all  frequency  channels,  then  the  maximum  group 
delay  error  would  also  be  10  psec. 

To  evaluate  the  effects  of  systematic  data  loss  in  only 
one  channel  we  can  use  the  equivalence  of  the  maximum  likeli¬ 
hood  and  weighted  least  squares  estimators  (see  Appendix  A). 
There  will  be  a  compensating  effect  when  only  one  channel 
loses  data;  the  greater  the  data  loss,  the  greater  the  phase 
error  in  that  channel  but  the  less  weight  this  channel  will 
have  in  the  estimation  of  the  group  delay.  In  Appendix  A,  we 
saw  that  the  weight  given  to  the  data  from  an  accumulation 
period  is  directly  proportional  to  the  number  of  data  segments 
used  in  evaluating  the  cross  spectral  components  for  that 
accumulation  period.  In  Figure  B.4.15,  we  show  the  errors  in 
the  estimated  group  delay  which  would  result  from  systematic 
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data  losses  in  all  accumulation  periods  for  a  given  channel 

when  the  a  priori  phase  delay  rate  is  in  error  by  10~^ 

sec/sec.  These  results  are  extreme  for  several  reasons.  We 

have  assumed  the  largest  possible  error  in  the  cross  spectral 

phase  for  a  given  percentage  loss  of  data,  i.e.  the  data  are 

lost  always  from  the  beginning  of  the  accumulation  period, 

every  accumulation  period  for  a  given  channel  was  assumed  to 

lose  exactly  the  same  amount  of  data  at  the  same  location,  and 

the  error  in  the  a  priori  phase  delay  rate  was  extreme.  More 

-12 

typical  errors  in  the  phase  delay  rate  are  10  sec/sec.  The 
curves  will  scale  linearly  with  phase  delay  rate  error.  In 
practice  if  an  accumulation  period  has  discard  rates  of 
greater  than  10%,  it  is  discarded  totally  and  will  produce  no 
error  in  the  estimates  of  the  group  delays.  The  use  of  this 
cutoff  in  the  amount  of  data  loss  in  an  accumulation  limits 
the  group  delay  error  to  20  psec  (*0.7  cm)  even  in  the  extreme 
case  studied. 

The  errors  in  the  phase  delays  which  resulted  from  these 
same  data  loses  were  all  less  than  2%  of  the  group  delay 
errors,  i.e.  <0.4  psec. 

The  use  of  upper  and  lower  sideband  data  will  not  cancel 
the  errors  given  because  for  such  errors  to  develop  requires 
loss  of  data.  These  data  losses  will,  in  general,  be 
different  for  upper  and  lower  sidebands  and  hence  the  error 
will  not  cancel. 

We  have  now  discussed  some  of  the  limitations  of  the  Mark 
III  processing  software.  There  are  other  effects  which  could 


be  discussed  which  involve  the  equipment  limitations,  e.g.  we 
have  already  seen  how  the  bandpass  filter  in  the  video  con¬ 
verters  can  affect  the  magnitude  of  an  error.  In  Chapter  3  we 
discuss  some  of  the  equipment  limitations  along  with  the 
investigation  of  the  quality  of  VLBI  measurements. 


APPENDIX  C.  Correlations  Between  VLBI  Observations 


When  three  or  more  radio  telescopes  simultaneously 
observe  the  same  radio  source,  redundant  information  is 
obtained  which  can  be  used  to  check  the  quality  of  the 
observations  (i.e.  for  a  baseline  triplet,  the  sum  of  the 
delays  referred  to  a  common  epoch  around  the  triplet  should 
ba  zero  if  there  is  no  contribution  to  the  sum  from  the 
brightness  distribution  of  the  radio  source) .  In  this 
appendix,  we  will  evaluate  the  magnitude  of  the  correlation 
between  the  VLBI  measurements  from  each  baseline  when  two 
baselines  have  a  common  station  and  simultaneously  observe 
the  same  radio  source.  The  estimated  correlations  can  be 
used  for  two  purposes.  Firstly,  if  the  observations  are 
correlated  then  we  should  account  for  this  correlation  when 
estimating  geodetic  parameters.  Secondly,  the  correlation 
should  be  accounted  for  when  we  are  computing  the  estimated 
uncertainty  of  the  sum  of  the  delays  around  a  triplet  of 
baselines . 

We  will  first  investigate  the  relationship  between  the 
cross  spectra  of  the  signals  recorded  at  each  site  when 
three  stations  simultaneously  observe  the  same  radio 
source.  The  relationship  between  the  cross  spectra  will 
allow  us  to  deduce  the  correlation  between  the  phases  of 
the  components  of  each  cross  spectrum.  Then,  using  the 
relationship  between  the  maximization  of  the  delay  resolu¬ 
tion  function  and  a  least  squares  approach,  established  in 
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Appendix  A,  we  will  compute  the  correlation  between  the 
observed  values  of  the  group  delays,  the  phase  delay  rates 
and  the  phase  delays  on  two  baselines  which  have  a  common 
station. 

Three  radio  telescopes  are  assumed  to  be  observing  the 
same  point  radio  source.  At  each  station  random  signals 
from  the  source  and  random  noise  from  the  receiver  and 
surroundings  are  being  recorded.  In  the  frequency  domain, 
the  spectrum  of  the  signals  recorded  at  each  site  will  be 

=  /Ti^  S1(u)  +  /Ts^  ^(w)  (C.l.a) 

X2(u>)  =  /Taj  S2(w)  +  /TsJ  B2(uj)  (C.l.b) 

X3(u,)  =  /Taj  S3(u)  +  /TsJ  S3(o,)  (C.l.c) 

where  /Taj  (u>)  ,/fa2  S2(u>)  and  /Taj  §3(a>)  are  the  spectra 
of  the  signals  from  the  radio  source  at  the  output  of  the 
receiver;  /TsJ  N^u),  /TsJ  N2(w)  and  /TIJ  fi3(w)  are  the 
spectra  of  the  random  additive  noise;  and  w  is  the  angular 
frequency  (see  Appendix  A  for  details) . 

In  Appendix  A,  we  found  that  the  spectra  of  the 
signals  from  the  radio  source  are  related  by 

32(co)  =  S1<w)e“1*12(w)  (C.2.a) 

§3(w)  *  §1(w)e"1<>13(u)  (C.2.b) 

where  $32(“)  and  4>^3(w)  ate  the  pairwise  phase  differences 
between  the  spectra  of  the  signals  recorded  at  each  of  the 
two  pairs  of  sites.  These  phase  differences  account  for 
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the  difference  in  arrival  times  of  the  signals  at  each  of 
the  sites.  We  can  obtain  estimates  of  these  phase  differ¬ 
ences  from  the  phases  of  the  cross  spectra  of  the  data 
recorded  at  each  site.  These  estimates  will  differ  from 
the  "true"  phase  differences  due  to  the  presence  of  the 
random  noise  ^  Ts^  (w)  ,/Ts2  N2(w)  and  s3  (<*>). 

The  cross  spectra  of  the  data  from  the  interferometers 
formed  by  Sites  1  and  2  and  Sites  1  and  3,  C12(u)  and 
C13  (w)  ,  are 

C12(u))  *  X*(o>)X2(g)) 

and 

C13(U)  *  X*(w)x3(w) 

The  expansion  of  these  cross  spectra  in  terms  of  the  signal 
and  noise  spectra  is 

Ci2(w)  *  /Ta1Ta2  S^S^e-**^  +  /Ta^Ts^  Sjfij 

+  ✓Ts1Ta2  N*S1e"1^12  +  ✓Ts^Ts^  nJn2  (C.3.a) 

C13(u>)  *  /Ta1Ta3  S^S^e” 1  ^13  +  /Ta^Sj  S*N3 

+  /TsxTa3  +  /Ts^TaJ  N*N3  (C.3.b) 

where  we  have  directly  substituted  Equations  (C.2)  into  the 
expressions  for  cross  spectra.  We  have  not  shown  the 
frequency  dependence  on  the  right  hand  side  of  the  equa¬ 
tions  . 

The  first  term  in  each  of  Equations  (C.3)  is  the 
signal  term.  The  remaining  terms  are  noise  terms.  There 


will  be  a  correlation  between  the  cross  spectra  if  they 
have  common  noise  terms.  There  is  one  common  noise  term 
which  arises  from  the  product  of  the  signal  spectra  and  the 
noise  spectra  at  the  common  site  (in  this  case  Site  1). 

The  noise  spectra  at  Site  1,  R^,  also  appears  in  another 
term  in  each  cross  spectrum,  /Ts^Ts2  R^R2  and  /Ts^Ts^j  R^R3, 
but  these  terms  will  not  introduce  any  correlation  because 
they  are  independent.  This  independence  is  easily  shown  by 
using  the  expansion  of  the  products  of  four  zero  mean 
Gaussian  variables  (Davenport  and  Root,  1958): 

<bJb2(H*R3)*>  *  <S*R2><B1H3>  +  <R*R1><B2!i3>  ♦  ^R^xrJr^, 

where  <>  denotes  expectation.  The  expectation  of  each  of 
the  products  in  the  expansion  is  zero  (at  least  one  pair  in 
each  term  is  composed  of  independent  factors) ,  and  thus 
there  will  be  no  correlation  between  the  cross  spectra  due 
to  these  terms.  By  the  same  arguments,  the  terms 
containing  the  signal  and  the  noise  at  the  remote  sites 
will  not  introduce  any  correlation  between  the  cross 
spectral  estimates. 

Of  the  three  noise  terms  in  the  cross  spectra  of  the 
signals  recorded  at  each  site,  only  one  is  common  to  cross 
spectra  from  the  two  baselines.  The  common  terms  from  the 
two  baselines  are  /Fs^Ta^  R^3^e“**12  and  /Ts^Ta3 
RlSie-1<^13.  These  two  terms  are  deterministically  related. 
They  differ  in  amplitude  by  /Ta2/Ta3  and  in  argument  by 
e-1 ( ^12“^13^ .  An  interesting  point  and  one  which  will  be 
very  useful  later,  is  that  the  angle  between  this  common 


term  and  the  signal  part  of  the  cross  spectrum  is  the  same 


for  both  cross  spectra.  This  angle  is  equal  to  the  argument 


of 


At  this  stage  we  will  group  the  two  independent  terms 
in  each  cross  spectrum  in  Equations  (C.3)  together  which 
yields 
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C12(w)  =  /Ta1Ta2  S^e"  *  ^  12  +  /Ts1Ta2  N*Sje-1*12  +  I12 


(C.4.a) 


and 


—  *  —  _  i  / 


C13(o>)  =  /Ta1Ta;}  S^e  *'*'13  +  /TSjTa-j  N*§1e"1,t,13  +  I13 


(C.4.b) 


where 


1 12  *  /Ta1Ts2  S*N2  +  /TSjTSj  N*N2 


and 


I13  =  •'Ta xTs3  3*n3  +  /fs^TSj  R^R. 


In  Appendix  A  we  saw  that  the  individual  estimates  of 
the  cross  spectral  components  are  not  used  to  directly 
estimate  the  values  of  the  VLBI  observables.  Coherent 
averages  of  the  cross  spectral  estimates  are  used.  The 
cross  spectral  components,  C*2(w),  averaged  over  an 
accumulation  period,  are  computed  from  (see  Appendix  A, 
Equation  (A. 15)  for  details) 

C12(u)  »nlq)^ii(“)X2i(«)ei(aap)  j 
j*l  J  J 

where  (a._)j  is  an  estimate  of  the  change  in  phase  between 
ap  j 

the  signal  part  of  the  cross  spectral  components  at  epoch  j 
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and  the  signal  part  at  the  first  epoch  in  the  accumulation 
period;  n(q)  is  the  number  of  estimates  of  the  cross 
spectral  components  in  the  qth  accumulation  period. 

If  (<*ap)j  is  efficiently  close  to  the  actual  change 

in  phase  of  the  signal  part  of  the  cross  spectrum,  then  the 

signal  part  of  the  cross  spectrum  counter-rotated  by  (ot  )  . 

ap  3 

will  have  a  constant  phase  and  the  magnitude  of  the  sum  of 
these  cross  spectra  will  increase  in  proportion  to  the 
number  of  segments  summed.  The  noise  component  of  the 
cross  spectrum  will  have  randomly  distributed  phases  and 
their  sum  will  increase  in  proportion  to  the  square  root  of 
the  number  of  segments  summed.  In  Table  C.l  we  summarise 
the  statistics  of  the  terms  in  the  cross  spectra  after  n(q) 
estimates  of  the  cross  spectral  components  have  been 
summed.  Also  in  Table  C.l  we  define  the  symbols  to  be  used 
to  denote  the  coherently  summed  cross-spectral  components. 
Figure  (C.l)  gives  a  graphical  representation  of  the 
components  of  the  averaged  cross  spectra.  We  should  note 
that  the  definition  of  the  averaged  cross  spectrum  includes 
the  antenna  and  system  temperatures.  For  this  reason, 
equations  which  contain  the  averaged  cross-spectral 
components  do  not  explicitly  contain  these  temperatures. 
This  approach  was  adopted  to  simplify  the  expressions  for 
the  phase  errors. 

By  combining  the  statistics  given  in  Table  C.l  with 
the  geometry  of  Figure  (C.l)  we  may  obtain  estimates  of  the 
statistics  of  the  phase  error  of  each  averaged  cross  spec- 


Figure  Crl  Graphical  representation  of  the  components  of  the  averaged 

•“3  -  a 

cross  spectra  and  ci3(w) 

Imaginary 
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tcal  component.  The  phase  errors  for  each  baseline  are 
denoted  in  Figure  (C.2)  by  an<*  ei3*  By  treating 
complex  numbers  as  vectors  in  the  complex  plane  and  assum¬ 
ing  that  the  noise  terms  are  small  compared  to  the  signal 
term,  we  can  write  these  phase  errors  as 

e  12  *  [I  i2*  ®i2  +  (Ni*S^e_i^i2)‘e12]/|g|*S^|  (C.5.a) 

and 

e i3  »  IH3*®i3  +  (Ni*S^e"l4,i3)«e13]/|S{*S^|  (C.5.b) 

where  e12  and  §i3  are  unit  vectors  perpendicular  to  the 
signal  vector  (see  Figure  (C.2));  $|2  and  *^3  are  the  cross 
spectral  phases  for  the  central  accumulation  period.  These 
phases  account  for  the  errors  on  the  a  priori  models  for 
the  total  phase  and  the  phase-delay  rate,  i.e.  the  phase  of 
the  averaged  cross  spectral  components  will  not  be  zero,  in 
general,  even  when  there  is  no  noise  present  because  of 
errors  in  the  a  priori  models  which  are  used  to  compute 

v  ap';j 

The  variances  of  ei2  and  e33  may  now  be  calculated 
from  Equation  (C.5)  and  the  statistics  given  in  Table  C.l. 
The  variances  will  be  given  by 
2 

<e 12>  *  (TSiTs2  +  TaiTs2  +  Ta2TSi) / (Ta iTa2n (q) ) 

and 

2 

<ei3>  *  (Ts^Ts3  +  Ta^Ts3  +  Ta3TSi) /(TaiTa3n(q) ) 

These  expressions  for  the  variances  may  be  simplified 
by  realizing  that  for  most  geodetic  experiments,  the 


antenna  temperatures  are  much  less  than  the  system 
temperatures.  If  we  assume  that  the  product  of  the  antenna 
and  system  temperatures  is  much  less  than  the  product  of 
the  system  temperatures,  the  expressions  for  the  variances 


reduce  to 


<ej2  >  *  Ts1Ts2/(Ta1Ta2n(q)  ) 
2 

and  <ei3  >  *  Ts^Ts^/fTa^Ta^nfq) ) 


(C.6.a) 


(C.6.b) 


The  final  expression  we  will  need  before  computing  the 
correlation  between  the  errors  in  the  estimates  of  the 
phase  of  the  cross-spectral  component  is  the  covariance 
between  the  phase  errors,  <ei2el3>*  This  covariance  will 
be  given  by 

<E12ei3>  = 

**12**12  *13*13*  +  <  ®2®  *  12#§12)  (S1  **3®  * 


lSi*S2llSi*S3l 


The  expectation  of  the  product  of  the  independent  noise 
terms  is  zero.  (This  result  was  shown  earlier  in  this 
Appendix.)  The  expectation  of  the  second  term  can  be 

_  *_  _ i a 

evaluated  by  noting  that  the  angles  between  N|  S£e  12  and 
§12  and  NJ  S^e  ^13  and  are  identically  equal.  Wfe  will 
denote  this  angular  difference  with  the  symbol  &  (see 
Figure  (C.2)).  This  equality  allows  the  expectation  of  the 
product  of  the  correlated  noise  terms  to  be  reduced  to 

<(f3'*5^e"i$12'«12)  (8i*S^e_i*13‘613)>  - 

<Re{|S'*S'  |ei4, }  Re{|fl'*S'  |eU}> 


—  J  7  J- 


which  may  be  evaluated  using  the  statistics  given  in  Table 
C.l.  The  covariance  between  the  phase  errors  will  be 

<e12e13>  =  Tsl/  [Tain<q>]  (C.7) 

The  combination  of  Equations  (C.6)  and  (C.7)  allows  the 
correlation  between  the  errors  in  the  estimates  of  the 
phases  of  the  cross-spectral  components  to  be  evaluated. 
This  correlation  will  be 


pe  *  <e12e13>/  ^<E12><e13>  *  /Ta2Ta3/Ts2Ts3  (C.8) 

Equation  (C.8)  gives  the  correlation  between  the  phase 
errors  of  the  summed  cross  spectral  components  for  a  single 
accumulation  period.  The  equivalence  of  maximizing  the 
delay  resolution  function  and  least  squares  can  now  be  used 
to  determine  the  correlation  between  the  errors  in  the 
multiband  group  delay,  phase-delay  rate  and  phase  delay 
estimates  for  the  pair  of  baselines. 

We  will  estimate  these  correlations  using  a  general 
approach.  If  we  treat  the  estimation  of  the  values  of  the 
VLB  I  observables  as  a  least  squares  problem,  we  can  express 
the  group  delay,  phase-delay  rate  and  phase  delay  as  a 
linear  combination  of  the  phases  of  the  cross-spectral 
components  from  each  accumulation  period.  We  may  write 
these  relationships  for  each  baseline  as 


P 


12  " 


l  aj(*12>j 


(C. 9.a) 


(C.9.b) 


and 

*13  =  l  “j(*13)j 

where  p12  and  p^  are  estimated  values  of  the  VLBI  observ¬ 
ables  (i.e.  multiband  group  delay  or  phase  delay  or  phase 
delay  rate),  are  the  linear  coefficients  which  relate 
the  coherently  averaged  cross-spectral  phases  to  the  param¬ 
eter  of  interest,  and  ( ♦  ^  2 )  j  an<*  (♦13)  j  are  cross-spectral 
phases  from  each  accumulation  period  on  each  of  the  base¬ 
lines.  The  summation  will  be  over  all  accumulation 
periods. 

The  form  of  Equations  (C.9)  implies  that  exactly  the 
same  linear  relationship  is  used  to  determine  the  values  of 
the  observables  from  the  cross-spectral  phases  from  each 
baseline.  This  is  not  exactly  correct  because  some  accum¬ 
ulation  periods  may  not  be  used  to  estimate  £^2  and  £33  on 
either  one  or  both  of  the  baselines  due  to  tape  reading 
errors.  We  will  investigate  the  effects  of  this  possibil¬ 
ity  shortly. 

Equations  (C.9)  may  be  used  to  determine  the  correla¬ 
tion  between  the  errors  in  the  estimates  of  £12  and  £33* 
This  correlation  will  be  given  by 


pp  =  <e 


P12  P13 


><e: 


'12 


'13 


which  reduces  to 
2 


PP  =  j° j<e12 je13j>/^(  j<e12j>)  (?® j<e13j>) 


(C.10) 


where  we  have  assumed  that  the  expectation  of  the  product 
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terms  from  different  accumulation  periods  is  zero.  (This 


assumption  is  consistent  with  the  statistical  assumptions 
made  in  Chapter  2  for  the  properties  of  the  signal  and 
noise  recorded  at  each  site.)  If  we  assume  that  the 
expectations  of  the  phase  error  products  are  independent  of 


j,  Equation  (C.10)  reduces  to 


"  <£12£13>//<ei2><ei3>  =  pt 


Therefore,  the  correlations  between  the  errors  in  the 
estimates  of  the  group  delays  for  a  pair  of  baselines  with 
a  common  station  will  be 

Px  =  >/'Ta2Ta3/Ts2Ts3  (C.ll) 

g 

Similarly,  the  correlations  between  the  errors  in  the 
estimates  of  the  phase-delay  rates  and  the  phase  delays 
from  each  baseline  will  be  given  by  the  same  expression. 

If  the  set  of  accumulation  periods  used  to  estimate 
the  values  of  the  observables  is  different  for  each  of  the 
baselines  the  correlation  given  by  Equation  (C.ll)  will  be 
reduced.  When  some  accumulation  periods  are  not  used  the 
variance  of  the  estimated  parameter  will  increase  while  the 
covariance  (2a  -<  (e  ^)  j  (E  13)  j>)  will  decrease  because  there 
are  less  terms  in  the  summation.  Hence,  Equation  (C.ll) 
places  an  upper  bound  on  the  correlation. 

Before  completing  this  appendix  we  will  evaluate  the 
effects  of  the  correlation  on  the  estimated  standard  devia¬ 
tion  of  a  baseline  triplet  closure  uncertainty.  We  will 


consider  a  simple  case  for  which  the  antenna  and  system 
temperatures  are  the  same  for  each  station  in  the  triplet 
and  exactly  the  same  accumulation  periods  are  used  for  each 
observation.  For  this  case  it  can  be  easily  shown  that  the 
variance  of  the  closure  quantity  (group  delay,  phase-delay 
rate  or  phase  delay)  will  be  over-estimated  by  six  times 
the  covariance  of  the  baseline  quantity  if  the  correlation 
between  baseline  quantities  is  not  taken  into  account. 

(The  variance  of  the  sum  of  the  delays  around  the  triplet 

2  2 
will  be  Cj,  =  <  <£i2  +  e23  +  e31^  >  * 

<ei2>+<e31>+<e31>+2<ei2e23>+2<e23e31>+2<ei2E31>*  For  the 
case  given  the  expectations  of  the  cross  terms  would  be 

2  2  2  2 

equal  and  therefore  o£  *  <T12+°l3  +  °31+6<e12e23>*  ^ 

TYp i cal ly  values  of  the  correlation  between  the  errors 
in  the  values  of  the  VLBI  observables  are  less  than  0.01, 
i.e.  typically  /Ta2Ta3/Ts2Ts3  is  less  than  0.01.  The  above 
result  would  indicate  that  neglecting  correlations  while 
computing  closure  statistics  would  introduce  errors  in 
these  statistics  of  less  than  1%. 

The  correlations  between  errors  in  the  group  delay, 
phase  delay  and  phase  delay  rate  estimates  can  be  easily 
accounted  for  when  closure  statistics  are  computed.  At 
this  time,  accounting  for  these  correlations  in  the 
weighted  least  squares  solutions  for  geodetic  parameters 
does  not  seem  warranted  given  the  uncertainty  in  the  appro¬ 
priate  statistics  for  the  group  delay  and  phase-delay  rate 
measurements  used  in  these  weighted  least  squares  solu- 


tions.  If  antenna  and  system  temperatures  improve 
considerably  (such  that  correlations  of  10%  are  likely),  it 
may  appear  that  the  correlation  between  measurements 
discussed  in  this  appendix  could  become  important. 

However,  if  this  situation  were  to  arise,  the  baseline 
length  measurements  would  be  so  accurate  (standard 
deviations  of  severval  microns,  probably),  that  we  could 
still  neglect  the  correlations.  [Remember,  that  for  the 
correlations,  due  to  random  noise,  to  be  important,  all 
other  noise  sources  in  the  analysis  must  be  small  compared 
to  this  random  noise.  With  the  current  receivers  and 
antennas,  the  standard  deviations  of  baseline  length  deter¬ 
minations  would  be  a  few  millimeters ,  if  these  determina¬ 
tions  were  signal-to-noise  ratio  limited.  (This  is  with 
less  than  1%  correlations  between  measurements.)  If  the 
sytem  temperatures  were  to  improve  tenfold,  then  baseline 
lengths  could  be  measured  with  submillimeter  accuracy 
(again  if  the  analysis  were  random  noise  limited). 

However,  with  these  improvements  in  system  temperatures,  we 
could  use  the  group-delay  measurements  to  predict  the  phase 
delay  ambiguities,  thus  decreasing  the  standard  deviations 
of  the  baseline  length  estimates  by  40  fold  (to  the  tens  of 
microns  level) .  If  the  analysis  was  still  signal-to-noise 
ratio  limited  at  this  stage,  the  correlations  could  be 
important.  Of  course,  it  is  very  unlikely  we  will  ever 


reach  this  stage.] 


In  this  appendix  we  will  develop  the  algorithms  for 
correcting  the  visibility  phases  for  the  effects  of  feed  horn 


rotation.  At  first  glance  it  is  not  obvious  why  the  orienta¬ 
tion  of  the  feed  horn  should  affect  these  phases.  Externally, 
feed  horns  appear  to  be  very  symmetrical  i.e.,  usually  conical 
in  shape.  However,  at  the  base  of  all  feed  horns  there  is  a 
crossed  dipole  or  an  equivalent  component.  We  will  first  show 
why  the  correction  is  necessary  by  considering  the  output  of  a 
pair  of  crossed  dipole  antennas  in  the  presence  of  a  propaga¬ 
ting  plane  wave.  We  will  then  develop  the  algorithms  for  an 
antenna  located  on  a  rotating  earth. 

The  electric  field  E  of  a  monochromatic  circularly  polar¬ 
ized  plane  wave  at  position  x  and  time  t  is  given  by  (Jackson, 
1975,  pp.  273-278) 


E(x,t)  =  EQ(e1±ie2) 


gi (k*  x-ut) 


where  Eq  is  the  amplitude  of  the  wave  (volts/m),  and  e2  are 
orthogonal  unit  vectors  (see  Figure  D.l)  ,  Jc  is  the  wave  vector 


of  the  wave  (directed  in  the  direction  of  propagation)  and  w 
is  the  frequency  (radians/sec)  of  the  wave.  The  upper  sign  is 
for  right  circularly  polarized  (RCP)  radiation  (IEEE  defini¬ 


tion,  i.e.  for  an  observer  looking  in  the  direction  of  propa¬ 
gation,  the  E  field  vector  at  a  fixed  location  appears  to 
rotate  clockwise) ;  the  lower  sign  is  for  left  circularly 


polarization  (LCP) .  (The  use  of  complex  numbers  to  represent 


Figure  D. 1  Geometry  of  dipole  and  field  unit  vectors 
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the  electric  field  is  a  mathematical  convenience.  The  actual 
field  quantities  can  be  obtained  from  the  complex  field 
quantities  by  taking  their  real  part.) 

To  detect  this  wave  we  need  a  method  for  converting  the 
spatially  propagating  wave  into  a  guided  wave  propagating 
along  a  transmission  line.  A  simple  method  to  detect  the 
propagating  wave  is  to  use  a  dipole  antenna.  The  voltage 
difference  between  the  terminals  of  the  dipole,  when  the 

A 

dipole  is  oriented  in  direction  d.  ,  will  be  (see  Figure  D.l) 


Vx(t)  =  ReficE^’  ( €1± i e2 )  e1  (R*xd"Wt)  } 

where  <  is  a  constant  (with  units  of  length)  which  will  be  a 
function  of  the  dimensions  of  the  dipole,  the  conductivity  of 
the  dipole  and  the  shape  of  the  E  field  in  the  region  around 
the  dipole  antenna;  and  x^  is  the  position  of  the  center  of 
the  dipole.  (The  fields  in  the  region  close  to  the  dipole  can 
not  be  plane,  in  general,  because  the  tangential  components  of 
the  electric  field  must  be  zero  on  the  surface  of  the  (per¬ 
fectly  conducting)  dipole.  For  this  discussion  we  do  not  need 
to  know  the  exact  functional  relationship  between  these 
quantities  and  k . )  The  above  equation  expresses  that  the 
component  of  the  electric  field  parallel  to  the  dipole  is 
measured.  (The  equation  is  strictly  valid  only  for  dipoles 
whose  length  is  small  compared  to  the  wavelength  of  the  radi¬ 
ation  (such  that  the  dipole  will  not  disturb  the  electric 
field  away  from  its  surface.  In  practice,  dipoles  can  have 
lengths  comparable  to  the  wavelengths  of  the  radiation 


•  •  /-  *'•  *'•  -  •  *  •  .*•  .  •  .  -  ;*♦  > 


(Jackson,  1975,  pp.  273-278).) 

In  order  to  detect  only  one  circular  polarization  the 

A  A 

output  of  a  second  dipole,  d2 ,  which  is  orthogonal  to  d^,  is 

*  o 

added  to  the  output  of  d^  with  a  +90  phase  shift  (Figure 

A 

D.l).  If  only  RCP  is  desired,  the  signal  from  6^  is  rotated 

o  * 

by  -90  ,  and  added  to  the  signal  from  d^.  Hence,  the  summed 

output  will  be 

V ( t )  =  Re{KEo[d1* (E1±i£2)-id2- (c1±ie2) le1 (R*xd-ut) }  (D.l) 

which  reduces  to 

V  ( t)  =  Re{<EQd1.  (e1  +  ie2)el(5'*Xd‘a,t)  }  (D.2.a) 

A 

when  E  ( x , t)  =  Eq ( e +  i e 2) e ^ k * X  ,i.e.  for  RCP  radiation, 

and 

V(t)  =  0  (D.2.b) 

A 

when  E(x,t)  =  E0(e^-ie2)e*  ^'X-U)t^  ,i.e.  for  LCP  radiation. 

We  may  further  simplify  Equation  (D.2.a)  by  substituting 
e  =  dL»  ( e1+ 1 e 2 ) 

where  4»,  the  feed  position  (or  parrallatic)  angle,  by 

_  1  * 

definition  equals  tan  (d^#  ^2^1*  ^1>  *  w*fch  this  substitution. 
Equation  (D.2.a)  becomes 

V(t)  =  RejxE^1  (R’Xd  +  l!'_U)t)  }  .  (D .  3 .  a) 

A  A 

When  the  signal  from  d2  is  added  to  the  signal  from  d^  with  a 
+90°  phase  shift  and  LCP  radiation  is  observed,  the  output  of 
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the  crossed  dipoles  will  be 

V(t)  =  RejicE^1  (^*Xd_<ij'“t)  }  .  (D.3.b) 

The  sense  of  angle  4>  is  shown  in  Figure  D.l.  Equations  (D.3) 
clearly  indicate  that  the  phase  of  V(t)  is  not  only  a  function 
of  the  separation  of  the  source  and  the  dipole,  but  also  a 
function  of  the  orientation  of  the  dipole. 

We  will  now  investigate  a  method  for  correcting  the 
visibility  phase  for  the  effects  of  feed  rotation.  To  be  able 

A 

to  calculate  the  feed  position  angle  we  need  to  express  d^ 
and  ^  in  a  common  coordinate  system.  As  with  most  problems 
the  ease  of  finding  a  solution  is  a  function  of  choosing  an 
appropriate  coordinate  system  and  set  of  unit  vectors.  We 
will  choose  the  vector  such  that  it  lies  in  the  plane 
containing  the  rotation  axis  of  the  earth  and  the  radio 
source,  and  is  pointed  north  (Figure  D.2).  (By  definition,  Ej 
is  orthogonal  to  E  and  hence  its  orientation  is  uniquely 
determined  except  for  sources  at  the  pole.)  For  sources  at 
the  pole  we  can  define  ^  to  point  in  any  well  defined 
direction,  e.g.  toward  the  equinox  of  date.  We  discuss  this 
special  case  after  we  have  formulated  the  general  case.) 

A 

The  d^  vector  will  be  defined  such  that  it  lies  in  the 
plane  containing  the  primary  axis  of  the  telescope  and  the 
source.  The  primary  axis  of  a  telescope  is  the  axis  whose 
direction  always  remains  fixed  with  respect  to  a  local  (earth 
fixed)  coordinate  system.  (For  azimuth/elevation  (az/el) 
mounts,  this  axis  is  in  the  direction  of  the  local  vertical. 


and  for  equatorial  mounts  this  axis  is  parallel  to  the 
rotation  axis  of  the  earth,  if  the  telescope  is  located  at  the 

A 

latitude  for  which  it  was  designed.)  The  orientation  of  d^  is 
uniquely  defined,  to  within  a  sign,  by  making  it  orthogonal  to 
the  direction  in  which  the  telescope  is  pointing  (assumed  to 
be  in  the  direction  of  the  radio  source),  (Figure  D.2).  (The 

A 

definition  of  d^  degenerates  for  az/el  telescopes  when  the 

radio  source  is  at  zenith.  In  this  case,  the  feed  rotation 

correction  cannot  be  calculated  because  the  radio  telescope 

can  be  pointed  in  any  azimuth  and  still  observe  the  radio 

source.  In  practice,  this  situation  does  not  occur  because 

the  antenna  pointing  programs  also  fail  in  this  situation.  In 

all  Mark  III  experiments  that  we  will  be  analyzing  there  are 

o 

no  observations  within  2  of  zenith  at  az/el  telescopes.  (The 
Mark  III  scheduling  program,  SKED,  will  not  allow  observations 
to  be  scheduled  at  az/el  telescopes  within  this  limit.) 

We  have  now  defined  a  convenient  set  of  unit  vectors,  but 
in  different  coordinate  systems.  To  find  the  relationship 
between  the  unit  vectors  we  may  project  them  onto  the 
celestial  sphere.  In  Figure  (D.3)  we  show  such  a  projection 

A 

of  the  unit  vectors  and  d^.  The  unit  vector  lies  in  the 
plane  of  the  rotation  axis  of  the  earth  and  the  radio  source, 
and  will  project  onto  the  great  circle  connecting  the  source 
and  the  pole  of  the  rotation  axis  of  the  earth.  The  unit 

A 

vector  d^  lies  in  the  plane  of  the  principal  axis  of  the 
telescope  and  the  source,  and  will  project  onto  the  great 
circle  between  the  source  and  the  pole  of  the  principal  axis 


Figure  D.3  Projection  of  the  source  and  feed  horn  unit  vectors  onto 
the  celestial  sphere 


Legend 

T  is  the  extension  onto  the  celestial  sphere  of  the  line  from  the  center  of 

the  earth  to  the  radio  telescope 

S  is  the  position  of  the  radio  source  on  the  celestial  sphere 

P  is  the  extension  of  the  Earth's  north  pole  onto  the  celestial  sphere 

A  is  the  extension  of  the  radio  telescope's  principal  axis  onto  the  celestial 

sphere 

are  the  longitude  and  latitude  of  the  radio  telescope 
9^,6^  are  the  right  ascension  and  declination,  of  date,  of  the  radio  source 
CST  is  apparent  Greenwich  sidereal  time 

ha  is  the  local  hour  angle  or  the  radio  source  IHA«GST+Xt-«g) 

is  the  xenith  angle  of  the  primary  axis  of  the  radio  telescope 
is  the  xenith  distance  of  the  radio  source 
is  the  feed  position  angle 
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of  the  telescope.  Figure  (D.3)  has  been  drawn  for  the  case 

when  the  principal  axis  of  the  telescope  lies  in  the  north- 

south  plane  at  the  site  and  is  inclined  by  angle  6  with 

respect  to  zenith.  If  the  telescope  is  an  azimith/elevation 

mount,  then  0  =0.  If  the  telescope  is  an  equatorial  mount, 

then  0  =x/2-<J>  ,  where  is  the  latitude  of  the  radio 
a  I  r 

telescope.  The  case  drawn  corresponds  to  an  equatorial  mount 
that  has  been  moved  from  the  latitude  for  which  it  was 
designed.  This  situation  will  apply  to  the  Richmond,  Florida, 
antenna  which  will  form  part  of  the  POLARIS  (POLar  motion 
Analysis  by  Radio  Inter f erometr y  Surveying)  network  (Carter, 
1979).  (The  primary  axis  of  the  Richmond  antenna  will  be 
placed  in  the  local  meridian  (Robertson,  1982;  private 
communication) ,  and  we  have  drawn  Figure  D.3  assuming  that 
this  will  be  the  case.) 

The  feed  angle  <i>  can  be  easily  computed  from  the 
spherical  triangle  PAS  (see  Figure  D.3) .  The  angle  between 
the  projection  of  the  principal  axis  of  the  telescope  and  the 
source  may  be  computed  using  the  law  of  cosines  for  spherical 
triangles.  Application  of  this  law  yields 

cos(z,)  =  sin (4>  +8  )  sin$  +  cos($_+9  )cos6  cosHA  (D.4) 
where  the  angles  are  defined  in  Figure  (D.3).  We  could 
determine  <1*  using  the  law  of  sines  but  care  would  be  needed  in 
resolving  the  correct  quadrant  of  the  inverse  sine.  (This 
approach  was  adopted  by  Knight,  1981,  Appendix  3,  although  the 
logic  used  to  derive  the  correction  was  somewhat  different 
from  that  presented  in  this  appendix.)  The  quadrant  problem 


can  be  resolved  by  using  the  half  angle  formula  (see,  for 
example,  Breyer,  1978;  pp.  175-178).  Thus, 

sin(s-V2+6_)sin(s-z  )  1/2 

tan  (4</2)  =  sign(HA)[ - 2 - — ]  (D.5) 

sin ( s-u/2+$  +9_ )  sin(s) 
r  3 

where  sign (HA)  is  the  sign  of  the  hour  angle  expressed  as  an 
angle  between  -n  and  it  (the  positive  value  of  the  square  root 
should  be  taken) ,  and  s  is  half  the  sum  of  the  sides  of  the 
spherical  triangle  PAS,  i.e.,  s  *  ((it/2-(4>  +  9  )  )  +  (n/2-6  )  + 

IT  3  S 

z  )/2.  Application  of  Equation  (D.5)  leads  to  a  unique 
determination  of  4>. 

When  6  =(tt/2)-$  ,  the  principal  axis  of  the  telescope  is 
parallel  to  the  rotation  axis  of  the  earth,  i.e.  A  and  P  in 
Figure  D.3  will  coincide.  In  this  case  zg  will  equal  (it/2)-6s 
and  s  will  reduce  to  (it/2)-6s.  Substitution  of  these  values 
into  Equation  (D.5)  shows  4»  will  be  zero  for  all  observations. 

We  may  now  write  down  the  algorithm  for  the  correction 
for  feed  rotation.  The  visibilty  phase  is  the  phase  of  the 
signal  at  Site  2  minus  the  phase  at  Site  1  at  the  same  epoch 
(see  Chapter  2  for  discussion).  From  Equation  (D.3. a)  we 
deduce  this  difference  in  phase,  for  RCP  (the  polarization 

used  in  the  Mark  III  experiments  that  we  will  be  studying): 

♦t  =  (E*x2+c i>2)  -  (Jc*;^*^) 

where  x2  and  x^  are  the  coordinates  of  the  dipole  at  Sites  2 
and  1,  respectively,  and  4<2  and  <1^  are  the  corresponding  feed 
angles  at  each  site.  The  visibility  phase  corrected  for  feed 


which  can  be  generated  from  the  observed  visibility  phase  by 
subtracting  the  appropriate  combinations  of  feed  angles.  For 
RCP,  we  have 

4>t  -  4>£  —  (4> 2 — ^ i )  *  (D.  6. a) 

For  LCP,  the  correction  is 

♦  ^  =  •  (D.6.b) 

We  should  note  that  <l>  is  independent  of  frequency  and  hence  no 

correction  need  be  applied  to  the  group  delays. 

In  Figure  D.4  we  plot  the  feed  rotation  correction,  for 

o 

an  interferometer  with  one  element  being  a  telescope  at  45 
latitude,  with  an  az/el  mount,  and  the  other,  a  telescope  with 
an  equatorial  mount,  at  an  arbitrary  location.  The  correction 
is  plotted  as  a  function  of  the  hour  angle  of  the  source  at 
the  az/el  telescope.  The  maximum  observing  time  at  the  az/el 
telescope  has  been  plotted  for  the  source  at  different 
declinations,  i.e.  the  limits  on  mutual  visibility  of  the 
source  from  the  two  sites  were  ignored. 

In  Figure  D.4  we  see  one  discontinuity  in  the  correction 
(for  6  =45°,  and  zero  hour  angle).  This  discontinuity  occurs 
as  the  source  passes  through  zenith  at  the  az/el  telescope. 
When  the  source  is  at  zenith,  the  radio  telescope  can  be 
pointed  in  any  azimuth  and  still  observe  the  radio  source. 

o 

The  feed  rotation  correction  can  have  any  value  between  ±180 
depending  on  the  azimuth  of  the  telescope.  In  practice,  this 
situation  does  not  occur  (see  earlier  discussion  on  the 

A 

definition  of  d^)  . 

We  mentioned  earlier  in  this  appendix  the  degeneracy  of 


source  at  az/el  mount  telescope  (hr) 
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our  definition  of  the  direction  of  when  6g=n/2.  We  may  now 
study  this  case  in  more  detail.  If  we  substitute  6  -*/2  into 

S 

Equation  (D.4)  we  see  that  z_  ■  n/2-  (4»_+0_)  and  hence  s  = 

3  I.  a 

ix/2  —  (4>r  +  ©a)  -  If  we  now  attempt  to  use  Equation  (D.5),  we  see 
that  the  equation  is  inderterminate ,  i.e.  both  s-za  and 
s-*/2  +  (j>r  +  ea  equal  zero  when  6s  =  it/2.  We  could  determine  the 
value  of  Equation  (D.5)  by  finding  the  limit  of 
sin'.j-z  )/sin(s-x/2  +  $  +0  )  as  6  +n/2 .  However,  a  simpler 

a  r  3  s 

approach  is  to  use  the  law  of  sines  to  determine  d>.  Thus, 

sin4<  *  sin(HA) cos($  +0.)  /sin(z_)  . 

r  3  3 

When  6g=Ti/2,  the  law  of  sines  yields  sin4»  *  sin  (HA)  and  hence 
0*  =  HAinit,  where  any  integral  value  of  n  (including  zero)  will 
satisfy  sin<p  =  sin(HA).  The  solution  consistent  with  Equation 
(D.5)  is  cj>  =  HA±n .  When  the  law  of  sines  is  used  to  solve  the 
problem  there  does  not  appear  to  be  any  singularity,  but  is 
undefined  for  sources  at  the  pole.  What  happened?  The  answer 
is  quite  simple,  for  sources  at  the  pole,  the  hour  angle  is 
undefined  because  the  source  may  have  any  right  ascension  (see 
Figure  D.3).  We  may  arbitarily  define  the  right  ascension  of 
the  source  and  hence  implicitly  define  the  direction  of  e^. 

(If  the  right  ascension  is  taken  to  be  zero,  will  be 
directed  toward  the  equinox  of  date.  This  may  be  shown  by 
considering  the  limit  of  the  geometry  in  Figure  D.3  as 
6g-*’x/2.)  Since  the  correction  to  the  interferometer  visibil¬ 
ity  phase  is  the  difference  between  the  feed  position  angles 
at  each  site  in  the  interferometer,  any  constant  change  in  the 
feed  position  angle  will  cancel  in  the  difference,  except  when 
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one  element  in  the  interferometer  is  an  equatorial  mount.  For 


equatorial  mounts  (at  their  design  latitudes) ,  we  saw  that  the 


feed  position  angle  was  zero  (see  earlier  discussion) .  This 


statement  is  not  correct  if  6s=x/2,  because  in  this  case  the 


radio  telescope  can  be  pointed  at  any  right  ascension  and 


still  observe  the  source.  For  reasons  similar  to  those 


discussed  for  az/el  telescopes  pointing  at  zenith,  this  situ¬ 


ation  does  not  arise  in  practice  and  hence  the  singul?..  ties 


in  the  definitions  of  unit  vectors  and  d^  should  cause  no 


problems  in  practice. 


There  are  several  assumptions  in  the  formulation  present¬ 


ed  in  this  appendix  which  are  not  strictly  valid.  We  assumed 


that  the  ^  unit  vector  was  fixed  with  respect  to  inertial 
space  by  defining  its  orientation  as  fixed  with  respect  to  the 


rotation  axis  of  the  earth.  However,  the  rotation  axis  of  the 


earth  precesses  and  nutates  in  inertial  space.  In  13,000 


years  (half  of  the  precession  period)  the  error  introduced  by 


the  motion  of  the  rotation  axis  in  inertial  space  will  reach 


its  maximum  value  of  =46  .  For  the  next  several  decades  we 


can  safely  ignore  this  error.  Similarly,  the  unit  vector  d^ 


was  referred  to  the  rotation  axis  of  the  earth  which  moves 


with  respect  to  the  crust  of  the  earth.  The  motion  of  the 


rotation  axis  with  respect  to  the  crust  of  the  earth  will  also 


introduce  some  error  in  the  formulation.  Both  of  these 


motions  of  the  rotation  axis  will  introduce  errors  of  less 


than  1  in  the  corrections  to  the  visibilty  phases  within  the 


next  several  decades,  with  the  major  error  contribution  coming 
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from  the  precession  of  the  rotation  axis  in  inertial  space. 
If  errors  of  this  magnitude  do  become  a  problem,  the  formu¬ 
lation  of  the  feed  rotation  correction  can  be  updated  to 
include  the  effects  of  precession  and  nutation. 
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In  this  appendix  we  review  the  weighted  least  squares 
estimation  technique.  We  will  not  derive  the  basic  least 
squares  algorithm  because  this  derivation  can  be  found  in 
nearly  all  books  on  estimation  theory.  We  will  instead 
concentrate  on  extensions  of  the  basic  formulation  which  allow 
large  numbers  of  parameters  to  be  estimated,  and  correlations 
to  be  accounted  for  (without  the  direct  inversion  of  large 
matrices).  We  will  also  study  some  methods  of  interpreting 
the  results  of  a  least  squares  analysis. 

The  basic  estimation  problem  which  we  wish  to  solve  is, 
given  n  observations  with  expectations  and  noise  contri¬ 
butions,  £,  such  that  £  =  Z+£>  (we  discuss  the  nature  of 

£  shortly) ,  we  wish  to  estimate  the  values  of  m  parameters,  jc. 
These  parameters  are  related  to  the  observables  by  a  mathe¬ 
matical  model  %  =  _f  (jc)  .  In  general,  jf(_x)  will  be  non-linear 
functions  and  we  will  first  linearize  the  estimation  problem 
by  using  a  Taylor's  series  expansion  of  £(jc)  about  a  priori 
values  of  the  parameters,  x  . 

With  the  first  order  Taylor's  series  expansion,  the  model 
becomes 

Z  =  +  ( }  ^21  (E.l) 

We  can  compress  the  above  equation  by  defining  the  "pre-fit" 
residuals  Ay_=  y-_f(x  )  and  the  partials  matrix  A  *  Z>f_/bx, 
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which  yields, 

A£  =  A  Ax  +  _e  (E.2) 

For  any  choice  of  Ajt,  Aj<,  we  can  calculate  the  "post-fit" 
residuals,  v,  given  by 

v  =  A£  -  A  A  5?  .  (E.3) 

If  we  know  the  statistical  properties  of  £,  i.e.  through  a 

var i ance-covar iance  matrix,  or  simply  covariance  matrix,  V£e  ■ 

<et_  >,  then  the  weighted  least  squares  estimator  is  defined  by 

T  -1 

the  estimates  of  A.x  which  minimize  v  V^v.  It  may  be  easily 
shown  that  the  weighted  least  squares  estimate  of  the  para¬ 
meter  corrections,  A)c,  is  given  by  (see,  e.g.,  Kaula,  1966, 
Chapter  5)  , 

AS  =  (ATV^A)_1  ATV~*A£  .  (E.  4 ) 

In  the  remainder  of  this  appendix,  it  will  be  convenient  to 
write  Equation  (E.4)  in  two  different  forms.  The  first  form 
uses  the  concept  of  generalized  matrix  inverses  (see,  e.g., 
Bjerhammar,  1973,  Chapter  9;  and  Bjerhammer,  1975),  and 
Equation  (E.4)  is  written  as 

Ax  =  A'A£  ( E . 5 . a) 

where  A-  is  the  generalized  inverse  of  A.  (The  generalized 
inverse  of  a  matrix  is  defined  by  A  =  AA~A.  We  should  notice 
that  if  A  is  a  non-singular  square  matrix,  the  generalized 
inverse  is  the  usual  inverse  of  matrix  A) .  We  should  note 


that  Equation  (E.5.a)  is  a  general  form  which  will  admit  all 
linear  estimators.  The  weighted  least  squares  estimator  is 
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just  one  form  of  the  (non-unique)  generalized  inverse.  We 
will  discuss  the  interpretation  of  A~  in  Section  3.1. 

The  second  form  in  which  we  will  write  Equation  (E.4)  is 


its  computational  form.  We  introduce  the  normal  equations,  N 


T  -1  T  -1 

=  A  V^^A ,  and  a  vector  5  =  A  V e£.A2*  Equation  (E.4)  then 


1 1 

reduces  to 


Ax  =  N 


(E.5.b) 


Equations  (E.5.a)  and  (E.5.b)  look  similar  but  the  sizes 
of  the  matrices  in  these  equations  are  very  different,  e.g., 
if  we  have  1000  observations,  and  100  parameters  to  be  esti¬ 


mated,  A  is  a  100x1000  matrix,  (requiring  390  kbytes  of 

-1  . 


storage),  whereas  N  is  a  100x100  (symmetric)  matrix  requir¬ 
ing  only  20  kbytes  of  storage.  Computationally,  Equation 
( E . 5 . b)  is  much  easier  to  implement  than  Equation  (E.5.a). 

The  advantages  of  Equation  (E.5.a)  will  be  discussed  shortly. 

We  now  return  to  the  investigation  of  the  differences 
between  the  pre-fit,  £,  and  post-fit  residuals,  v.  If  we 
substitute  Equation  (E.5.a)  into  Equation  (E.3),  and  use 
Equation  (E.2)  to  eliminate  A£  from  the  resulting  equation,  we 
obtain 

v  =  (I-AA-)  e  .  (E. 6 ) 


This  equation  gives  us  a  relationship  between  the  pre-fit  and 
post-fit  residuals.  It  also  allows  us  to  compute,  for  an 
error  l  in  the  observations  (e.g.,  due  to  inadequate  models  or 
instrumental  errors),  the  form  of  the  post-fit  residuals. 

When  the  form  of  £  is  known.  Equation  (E.6)  provides  a  con- 
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venient  means  of  detecting  t  in  the  post-fit  residuals. 

It  would  appear  from  Equation  (E.6)  that  we  could  esti¬ 
mate  the  errors  in  the  observations  by 

l  =  ( I -AA~ ) ~ 1  V  .  (E.7) 

Unfortunately,  the  matrix  I-AA~  is  singular  and  hence  does  not 
have  a  unique  inverse.  The  solutions  of  Equation  (E.7)  are  of 
the  form 

£  =  v  +  A6x 

where  6 jc  are  arbitrary  corrections  to  the  parameters.  It  is 
interesting  to  note  that  if  we  can  bound  j:,  then  we  could 
bound  6x,  and  hence  place  an  upper  bound  on  the  effects  of 
systematic  errors. 

We  can  estimate  from  Equation  (E.6)  the  covariance  matrix 
of  the  post-fit  residuals,  Vvv, 

Vvv  =  <™T>  *  (I-AA")TVee(I-AA")  (E .  8 ) 

which  after  some  matrix  manipulation  reduces  to 

Vvv  =  ( I-AN-1AT)  Vee.  (E.9) 

-1  T 

Ideally  the  matrix  AN  A  should  have  all  small  elements,  so 

that  the  post-fit  residuals  have  statistics  which  are  similar 

to  those  of  t,  but  this  will  not  always  be  the  case.  For 

-1  T 

large  numbers  of  observations,  I-AN  A  is  a  very  large  matrix 

and  we  currently  have  no  software  which  will  compute  this 

.  —It 

whole  matrix.  (For  1000  observations,  I-AN  A  would  require 
nearly  2  Mbytes  of  storage.) 

It  should  also  be  noted  from  Equation  (E.9)  that,  in 


general,  the  off  diagonal  elements  xn  I-AN  A  will  be  non¬ 
zero  and  hence  correlations  between  the  post-fit  residuals  do 
not  necessarily  mean  that  the  observation  noise  was  correlated 
(see,  e.g..  Section  3.1  for  discussion  of  correlations  among 
residual  phases) . 

The  estimation  technique  we  have  just  discussed  uses  a 
parameterization  of  the  observables.  There  is  another  formu¬ 
lation  of  estimation  problems  which  is  normally  not  discussed 
in  estimation  texts  because  it  is  usually  difficult  to  apply. 
This  technique  is  estimation  by  conditions.  If  m  observables 
are  functions  of  n  parameters,  then  there  must  by  m-n  condi¬ 
tions  which  can  be  imposed  on  the  observations,  e.g.,  if 
height  differences  between  points  are  measured  around  closed 
loops,  then  the  conditions  would  be  that  the  sum  of  height 
differences  between  points  around  each  closed  loop  should 
equal  zero.  (For  this  example,  we  could  also  use  a  parameter 
estimation  technique  by  expressing  the  height  differences  as 
functions  of  the  heights  of  each  point.)  For  the  height 
difference  measurements,  the  conditions  are  easily  formulated, 
but  in  many  cases,  finding  the  conditions  can  be  very  diffi¬ 
cult.  We  will  develop  this  method  because  some  problems  can 
be  solved  very  easily  with  this  approach. 

We  express  the  (linearized)  conditions  between  the 
observables  as 


0 1  =  He 


(E.10) 


where  U  is  a  matrix  of  the  condition  partial  derivatives  and 
Uo  is  a  vector  of  constants.  (For  the  height  difference  case  0 
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would  be  a  (m-n)xm  matrix  containing  ones,  minus  ones  and 
zeroes,  and  the  an  (m-n)  null  vector).  As  in  the  case  of 
the  parameter  estimation,  the  observations  are  related  to 
their  expectations  by  £  =  ^[+£.  We  now  want  to  find  estimates 
of  the  expectations  which  satisfy  the  conditions,  subject  to 

T  —  1  T 

v  V££.v  being  a  minimum,  where  v  =  To  s°lve  this  problem 

we  use  Lagrange  multipliers  i.e.  we  minimize  the  function,  E, 
given  by 

E  =  vTV~^v  -  2KT(Uv- (U^-U^) )  (E.ll) 

T 

where  2K  are  the  Lagrange  multipliers. 

A  stationary  value  of  E  will  occur  when  bE/bv  =  2V~£v  - 
T 

2U  K  =  0,  and  therefore  E  will  have  a  stationary  value  when 

v  =  veeuTH  •  (E . 12 ) 

The  Lagrange  multipliers  can  be  determined  be  substituting  Ov 
=  Eo-°£  1  nto  the  above  equation  which  yields 

K  =  (UVeeUT)”1  (Uq-U^)  .  (E.13) 

We  may  caluculate  the  post-fit  residuals  by  substituting 
Equation  (E.13)  into  Equation  (E.12).  Hence, 

v  =  V  UT(UV  UT)_1  (U  -Ov)  .  (E . 14 ) 

—  tz  tt  — o 

.  T 

The  covariance  matrix  of  v  will  be  given  by  <vv  >.  Hence, 

<vvT>  =  <veeut(uveeut)_1(uo-05) (uo-u^)T(ovEeuT)'1ovEE> 


which  reduces  to 


Vvv  -  Vee  -  VE£uT<UVteuT>~luVtt 


(E.15) 


assuming  that  <£>  =  0,  and  that  is  not  a  random  vector. 

We  should  note  that  the  estimation  using  either  para¬ 
meters  or  conditions  will  yield  identical  results,  provided 
that  the  parameterization  is  consistent  with  the  conditions. 
The  only  distinction  between  the  two  techniques  is  the  ease  of 
implementation . 

Equation  (E.14)  shows  both  the  advantages  and  disadvan¬ 
tages  of  estimation  using  conditions.  The  matrix  to  be 
inverted  in  Equation  (E.14)  has  the  dimensions  of  the  number 
of  conditions.  If  the  number  of  conditions  is  small,  then 
Equation  (E.14)  can  be  easily  solved.  If  the  number  of  condi¬ 
tions  is  large,  then  the  inversion  of  the  matrix  in  Equation 
(E.14)  is  formidable. 

One  particular  application  cf  Equation  (E.14)  which  we 
will  investigate  is  "parameter  supression."  Quite  often  after 
parameters  have  been  estimated  from  a  set  of  observations,  we 
need  to  determine  the  changes  that  would  occur  to  the 
parameter  estimates  if  a  subset  of  the  parameters  had  not  been 
estimated.  We  could,  of  course,  repeat  the  estimation  with 
these  parameters  not  estimated,  but  this  could  be  very  time 
consuming.  Estimations  by  condition  provides  an  ideal  solu¬ 


tion  to  this  problem. 

T 

We  denote  the  estimated  parameters  by  (AjHaB  )  where  we 

mi~’  — s 

have  partitioned  the  parameter  estimates  into  the  subsets;  Afc 


m 


is  the  subset  of  parameters  we  wish  to  suppress,  anc'  A I  are 
the  remaining  parameters.  Clearly,  had  A8  not  been  estimated 
then  Ajcg  would  equal  a  null  vector.  These  are  our  conditions. 
The  condition  equation,  (Equation  E.10),  becomes 
(0  I)  I  Aft  1=0 


where  I  is  a  (square)  unit  matrix  with  dimensions  of  the 
number  of  parameters  to  be  suppressed.  If  we  now  substitute 
these  conditions  into  Equation  (E.14)  with  the  covariance 
matrix  of  the  parameter  estimates  partitioned  as 
V  =  V  V 

EG  XX  XS 

T 

V  V 
xs  ss 

we  obtain 


Ax' 

— 

-V„_V~^AX_ 

x  s  s  s  —  s 

Ais 

-Ais 

(E . 16 ) 


where  Ajc'and  AxJL  are  the  corrections  to  estimates  of  the 
parameter  corrections  which  were  obtained  when  all  of  the 
parameters  were  estimated.  (Note,  that  when  A8'  are  added  to 

—5 

the  original  initial  parameter  estimates  (A8  ) ,  the  net 

correction  to  the  supressed  parameters  will  be  zero.) 

T 

The  covariance  matrix  of  (Ajc'|A$^)  may  be  calculated 


from  Equation  (E.15).  The  result  is 
Vs-  -  I  V*x-VxsV«Vxs  0  I 


(E.17) 


where  we  note  that  AJc^  has  zero  variance  because  we  assumed 


these  parameters  were  perfectly  known  when  their  estimation 
was  supressed. 


If  only  one  parameter  is  supressed.  Equations  (E.16)  and 
(E.17)  reduce  to 

A*j  =  -<pjsW  A*s  (E.18.a> 

and 

(a!?  =  (1-Pjs>  a?  (E.18.b) 


where  Ax^  is  the  estimate  of  the  jth  parameter  correction  when 


parameter  s  is  suppressed,  p-  is  the  correlation  between  the 
—  J 5 


2  2 

parameter  estimates,  a ^  and  as  are  the  variances  of  the  para- 


2 

meter  estimates,  and  (cj)  is  the  variance  of  the  new  estimate 


of  the  j  th  parameter.  These  simple  formulas  are  very  useful 
for  studying  the  interaction  between  parameters. 

We  now  return  to  Equation  (E.6)  to  study  methods  which 
can  be  used  to  solve  this  equation  when  the  normal  equations 
become  too  large  to  be  handled  in  the  main  memory  of  the 
computer.  The  technique  we  will  develop  is  called  sequential 
least  squares  (see,  e.g.,  Kaula,  1966;  Morrison,  1969).  We 
wish  to  find  an  equivalent  solution  to  Equation  (E.6)  which 
does  not  involve  inverting  the  complete  normal  equations  at 
one  time.  We  can  find  such  a  solution  provided  that  none  of 
the  observations  depend  on  all  the  parameters.  We  will 
subdivide  the  parameters  into  two  types:  global  parameters, 
which  affect  large  numbers  of  observations  (e.g.,  station 


positions  and  radio  source  positions);  and  local  parameters, 
which  affect  a  relatively  small  number  of  observations  (e.g. 


clock  and  atmosphere  polynomial  coefficients) . 

There  are  many  techniques  which  we  could  use  to  develop 
such  a  solution.  One  method  which  is  particularly  straight¬ 
forward  is  to  treat  the  parameters  as  observations.  (We  have 
already  been  doing  this;  remember,  the  group  delay  measure¬ 
ments  are  themselves  estimates  of  parameters  obtained  from 
estimating  the  intercept  and  slope  of  the  video  DC  phases;  see 
Appendix  B.) 

We  commence  by  extending  Equation  (E.2)  to  include  the 
parameters  as  observations,  which  yields 


Ay  = 

* 

Ax  + 

E 

Ax 

I 

e 

— x 

where  AX  are  a  priori  estimates  of  the  parameter  corrections 
with  errors  £x  whose  covariance  matrix  is  N”1.  If  we  now 
apply  the  estimator  Equation  (E.4)  to  the  above  equation  we 
find 

as  -  [»T  i]|v;J  o  h»|[at  i]|v;J  o  u&pi 

I  o  Mg  I !  1 1  I  »  »,IU*I 

which  reduces  to 

Ax  =  (ATV‘*A  +  Ng)“1(ATV^Ay  +  NgAX)  (E.20) 

Equation  (E.20)  is  the  sequential  least  squares  estimator. 

This  equation  is  very  easily  implemented  into  a  standard  least 

squares  program  because  the  normal  equations  are  formed 

exactly  the  same  way  as  they  normally  would  be  formed 
T  1 

(A  V-  A)  and  then,  before  inversion,  the  inverse  of  the  a 


priori  covariance  matrix  is  added  to  them.  Similarly,  the 

T  -1 

correction  vector  (h  V£E;Ay)  is  calculated  as  it  normally  would 
be  calculated,  and  then  the  a  priori  correction  vector  (Ng&x) 
is  added.  When  the  solution  given  by  Equation  (E.20)  is 
completed,  the  covariance  matrix  of  the  global  parameters  are 
"stripped"  from  the  inverse  of  the  normal  equations  to  yield 
an  updated  global  parameter  covariance  matrix  which  can  be 
•’.sed  when  the  next  set  of  data  is  to  be  included  in  the 
solut ion . 

Once  all  of  the  data  have  been  used  in  the  sequential  or 
global  solution,  the  estimates  of  the  global  parameters  are 
the  same  as  those  estimates  which  would  have  been  obtained 
from  a  standard  least  squares  solution  using  all  of  the  data 
simultaneously.  The  estimates  of  the  local  parameters  for 
each  data  set,  except  the  last,  which  were  obtained  as  the 
sequential  solution  was  carried  out  are,  however,  not  correct 
(because  their  estimation  did  not  use  the  information  about 
the  global  parameters  which  would  be  obtained  from  future  data 
sets)  . 

To  recover  the  estimates  of  the  local  parameters  a  back 
solution  is  run.  To  construct  the  back  solution  we  use  the 
property  of  sequential  least  squares  solutions,  that  the  local 
parameters  for  the  last  data  arc  used  are  recovered  correctly 
(because  all  of  the  information  about  the  global  parameters 
has  been  included  in  solution).  To  use  this  property  in  the 
back  solution  we  essentially  remove  the  contribution  of  a  data 
arc  from  the  (full)  global  solution  and  then  add  it  back  in  as 


if  it  were  the  last  arc  in  the  solution.  This  may  sound  like 
a  sleight  of  hand,  but  the  technique  should  work. 

The  implementation  of  this  technique  is  relatively 
straight-forward  but  is  more  complicated  than  the  forward 
solution . 

To  see  how  the  back  solution  is  carried  out  we  will  run 
through  the  matrix  manipulations  needed  to  process  one  arc 
during  the  back  solution.  The  relationship  between  the 
observations  and  the  parameters  (separated  into  local  and 
global  parameters,  ax^  and  AXg,  respectively)  will  be  (from 
Equation  E.2) 


(E. 21) 


The  normal  equations  for  the  solution  will  be  given  by 
(including  the  a  priori  covariances  for  global  parameters) , 


|a'V1A  +(Uf  l)~1  Ajv-VllJkV1***  ("a"1)"1^'1 

g  ee  9  g  g  ee  A  9  ee  9  9 


!‘V, 
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A  V  W 
9  ee  X 
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A  e  e  A 
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X  ee 


(E. 22] 


where  (Ng-1)-1  is  the  covariance  matrix  of  the  global  para¬ 
meters  which  would  be  obtained  from  all  of  the  data  excepting 
the  arc  which  is  being  processed  (in  general  this  matrix  is 
not  known) .  In  order  to  reduce  the  number  of  symbols  needed 
in  future  equations,  we  will  write  Equation  (E.22)  as 


(E.23) 


where  Bn  =  <N*- V1 ,  B12  =  B22  =  A^A,, 


5g=AgV“^A?+ (Ng-1) _1 AXg-1 ,  and  z x=A^v"^ A? .  We  should  note  the 


matrices  B^2  and  B22,  and  the  vector  2^  are  known,  but  the 
matrix,  B^,  and  the  vector,  2  ,  are  not  known  (unless  this 


arc  was  the  last  arc  to  be  used  during  the  forward  solution) . 
The  matrix  B^  and  the  vector  2g  may  be  calculated  by  treating 
this  arc  as  if  it  were  the  last  arc  in  the  solution. 


If  Equation  (E.23)  represented  the  last  arc  in  the  solu¬ 
tion,  then  the  inverse  of  the  normal  equations  would  be 


B11  ®12 

ii 

H 

1 

<V  C12 

B12  B22 

C12  C22 

(E. 24 ) 


where  (N*j)  1  is  the  final  global  parameter  covariance  matrix, 
y 


(The  other  elements  in  the  inverse  C12  and  C22  do  not  interest 


us  at  this  time.)  We  may  find  an  expression  for  (Ng)-1  in 


terms  of  B^,  B12  and  B22,  by  performing  a  partitioned  inverse 
of  the  normal  equations.  This  partitioned  inverse  will  be 
given  by  (see,  e.g.,  Beyer,  1978  p.ll) 


<Ng>-1  -  (B11-B12B22BL>>'1- 


By  re-arranging  the  above  equation,  we  can  obtain  an  expres¬ 
sion  for  unknown  matrix  B^ ,  i»e»» 


B12B22B12 ’ 


(E.25) 


To  find  the  unknown  vector  2 


t 


we  use  a  similar  procedure 


From  Equations  (E.22)  and  (E.24) ,  we  have 


<Ng>  C12 

*9 

= 

*9 

C12  C22 

*2 

and  the  estimates  of  the  global  parameters  will  be  given  by 

♦  Cl 2tl  "  *g  <E-26> 

Again  using  the  partitioned  inverse  of  the  left  hand  side  of 
Equation  (E.24) ,  we  can  express  (the  unknown)  C12  matrix  in 
terms  of  the  (known)  matrices  (Equation  B.25)  ,  B22  and 

B 1 2 *  i.e.f 

C12  *  -<Ng»'lB12Bi2  ' 

Therefore 

*_  =  (N^)  _12  -  (N^)”1B1,B:h1 

g  g  g  g  12  22  * 

which  may  be  re-arranged  to  yield 


Now  we  can  substitute  the  calculated  matrix  B. .  and  vector 

11  g 

into  the  normal  equations  and  the  back  solution  can  be  com¬ 
pleted  using  the  standard  SOLVE  software. 

Global  solution  capability  was  added  to  SOLVE  by  adding 
three  programs  to  the  SOLVE  suite  of  programs.  Program  INITL 
will  save  the  global  parameter  adjustments  and  covariance 
matrix  from  any  SOLVE  solution.  Program  APCOV  (Apply 
COVariances)  will  add  the  covariances  saved  by  INITL  to  the 


normal  equations  (this  program  is  automatically  scheduled  by 
SOLVE)  and  APBAK  will  carry  out  the  matrix  multiplications  for 
the  back  solutions  (again  this  program  is  automatically 
scheduled) . 

The  global  solution  software  has  some  limitations 
(imposed  by  the  computer  size).  Currently,  the  maximum  number 
of  parameters  which  can  be  estimated  during  any  single  arc  is 
95.  As  data  from  many  experiments  are  combined  the  number  of 
global  parameters  (source  and  station  positions)  can  get 
sufficiently  large  that  the  95  parameter  limit  for  an  arc  can 
be  exceeded.  There  are  two  ways  to  overcome  this  problem.  The 
simplest  method  is  to  suppress  some  of  the  global  parameters. 
This  could  be  done  for  the  positions  of  radio  sources  which 
are  only  used  in  a  single  experiment.  The  utility  of  this 
technique  is  limited,  however.  An  alternative  technique, 
which  has  not  yet  been  implemented  in  the  SOLVE  programs,  is 
to  combine  separate  global  solutions  together,  i.e.,  if  the 
number  of  global  parameters  becomes  too  large,  a  second  global 
solution  is  started.  When  this  second  solution  is  completed, 
the  two  covariance  matrices  are  combined  to  yield  the  total 
global  sc  ition. 

The  mathematical  model  which  would  be  used  to  combine  the 
solutions  is 

|I|l*illl  =  l*£l  (E.28) 


I 
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g 
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t 

where  0  is  a  rotation  matrix  and  i  is  vector  of  rotation 


^  r*  u- . 


parameters  which  accounts  for  rotations  of  the  crust-fixed  and 
"inertial”  coordinate  systems  (see  Chapter  5)  between  the  two 
global  solutions.  Equation  (E.28)  can  be  solved  using  the 
techniques  which  have  already  been  described. 

Many  of  the  techniques  introduced  in  this  appendix  are 
used  in  other  sections  of  this  thesis  and  the  practical 
implementation  of  these  techniques  may  be  clearer  when  they 
are  discussed  in  these  sections. 


The  Mark  III  VLBI  system  nominally  observes  right- 
circularly  polarized  (RCP)  radiation.  However,  since  perfect 
feed  horns  can  not  be  made  some  amount  of  the  lef t-circularly 
polarized  (LCP)  radiation  from  the  radio  source  will  "leak" 
into  the  output  of  the  nominally  RCP  feed  horn.  In  this 
appendix  we  consider  the  effects  of  such  a  leakage  on  the 
group-  and  phase-delay  measurements. 

In  order  to  examine  the  effects  of  polarization  leakage, 
we  commence  by  writing  the  expressions  for  the  spectra  of  the 
signals  at  the  outputs  of  the  feed  horns  of  the  antennas  which 
form  an  interferometer.  We  do  this  by  writing  a  more  general¬ 
ized  form  of  Equation  (A. 5).  The  spectra  of  the  signals, 

Xj^w)  and  X2(w),  at  the  outputs  of  the  feed  horns  will  be 
given  by  (see  Appendix  A  for  introductory  discussions): 

Xx  (to)  =  R  +  PL  +  (F.l.a) 

X2(<d)  =  KR'  +  KyL'  +  (F.l.b) 

where  R  and  R'  are  the  RCP  components  of  the  signal;  L  and  L' 
are  the  LCP  components  of  the  signal;  8^  and  are  noise 
components;  K  is  a  constant  which  represents  the  difference  in 
sensitivity  of  the  two  antennas  (see  Equation  (A.5));  5  and  y 
are  complex  constants  which  express  the  amount  of  LCP  radia¬ 
tion  leaking  into  the  outputs  of  the  feed  horns.  These 
constants  are  complex  to  allow  for  phase  shifts  of  the  LCP 
signals  relative  to  RCP  signals  after  these  signals  have 
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propagated  through  the  feed  horns.  Equations  (F.l)  are 
derived  from  Equation  (A. 5)  by  explictly  writing  the  signals  S 
and  S'  in  terms  of  their  RCP  and  LCP  components.  For  perfect 
RCP  feed  horns  P  and  y  would  both  be  zero.  We  now  proceed  in 
a  manner  similar  to  the  analysis  which  was  used  in  examining 
Equation  (A. 5).  The  RCP  components  R  and  S'  are  related  to 
each  other  by  (see  Appendix  A  for  details): 

R*  (w)  =  R  (to)  e~la  +  **  (F.  2. a) 

where  a  =  (w-u0)  Tg  +  +  wtpAt;  o>0  is  the  reference  (angular) 

frequency  to  which  the  visibility  phase,  $t,  is  referred; 
is  the  group  delay;  t  is  the  phase  delay  rate;  At  is  the  time 
interval  between  the  epoch  to  which  the  delay  measurements  are 
referred  and  the  temporal  center  of  the  data  used  to  compute 
the  spectra  (see  Equation  (A. 5)  for  more  detailed  discus¬ 
sions)  ;  and  A4>  is  the  difference  of  the  feed  horn  position 
angles  at  the  two  antennas  (see  Appendix  D  for  details) .  The 
difference  of  the  feed  horn  position  angles  was  implicity 
included  in  <t>t  in  the  derivation  of  Equation  (A. 5).  We 
explicity  include  it  here  because  its  effect  on  the  LCP 
signals  is  opposite  signed  to  its  effect  on  the  RCP  signals. 

We  may  write  an  expression  similar  to  Equation  (F.2.a)  for  the 
LCP  signals: 

L’  (U)  «  L  (w)  e"1®  “  A4,  (F.2.b) 

where  we  note  that  the  feed  position  angle,  A<|>,  affects  the 
phase  of  E' (w)  with  opposite  sign  of  its  effect  on  R' (w) . 

The  cross  spectrum  of  the  signals  at  the  output  of  the 


feed  horns  will  be 


X^X*  =  KRR 1  *  +  Ky  *RL ’  *  +  RN^*  +  KgLR 1  *  +  Kg y  *LL ' * 

+  gLN£*  +  KNj^R'*  +  Ky*N^L 1  *  +  N^N^* 

and  the  expectation  of  this  cross  sprectrum  is 

<X1X*>  =  KR2  e_la  +  A<i'  +  K(3 y *L2  e-1®-^  (F.3) 

where  we  have  assumed  that  the  radiation  from  the  radio  source 

* 

is  unpolarized  (i.e.,  <LR  >  ~  0) ,  and  we  have  substituted  the 

relationships  between  the  RCP  and  the  LCP  signals  received  at 

the  two  antennas  (Equations  (F.2)).  This  cross  spectrum  is 

shown  graphically  in  phasor  form  in  Figure  F.l.  We  see  from 

this  figure  that  the  angle  between  the  LCP  and  RCP  phasors  is 
_ * 

2A<j>+arg  ( 0 y  ).  If  the  LCP  signal  is  small  compared  to  the  RCP 
signal,  i.e.,  mag(py  )<<1,  we  may  express  the  error  in  the 
visibilty  phase,  A4> ^ ,  due  to  the  leakage  of  the  LCP  radiation 
as 

A<t>A  =  mag  (By*)  sin(2A<l>  +  arg(gy*)).  (F.4) 


The  group  delay  error  due  to  the  polariztion  leakage  can  be 
obtained  by  differentiating  A4>a  with  respect  to  (angular) 
frequency: 

At^  =  mag(  a(f3y*)/3u  )  sin(2A<l;  +  arg(  b($y*)/dio  1 


which  can  written  in  the  form 


At.  =  At^  sin(2Acp  +  5**) 


where  At£  =  mag(  d(£y  )  /bu  )  and  =  argf  a(gy  ) /Bu>  ). 


We  should  note  that  the  polarization  leakage  will 


/•+  -  •  v  /  V  «V-f.  ‘v  ’ 


(F.5) 


introduce  errors  in  the  group  delay  and  the  visibility  phase 
measurements  only  if  the  feed  horns  at  both  antennas  in  an 
interferometer  are  leaking  LCP  signals,  i.e.,  both  5  and  y 
must  be  non-zero.  For  this  reason,  polarization  leakage  can 
cause  closure  errors  when  two  antennas  in  a  triplet  of  of 
antennas  have  defective  feed  horns.  Only  the  measurements 
with  the  interferometer  which  contains  the  two  antennas  with 
defective  feed  horns  will  be  affected  by  the  LCP  signals.  T 
closure  errors  which  are  believed  to  be  due  to  polarization 
leakage  are  discussed  in  Section  3.3. 
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